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PREFACE 


This  paper  was  prepared  under  an  IDA  Central  Research  Project  titled 
“Application  Theory  of  Target  Recognition.” 

The  authors  wish  to  thank  two  of  our  colleagues,  Amnon  Dalcher  and  Michael 
Tuley,  who  have  reviewed  this  paper  and  made  some  valuable  suggestions.  In  particular, 
Dr.  Dalcher  suggested  that  the  strange  behavior  being  observed  at  low  SNRs  for  the 
noncoherent  case  could  be  due  to  the  need  to  employ  a  square-law  detector  at  low  signal- 
to-noise  levels.  This  questioning  led  to  the  revised  formulation  presented  in  Section  V.C. 
Mr.  Tuley  asked  whether  our  use  of  the  Brennan  and  Reed  criterion,  while  maximizing 
SNR,  would  also  lead  to  maximum  discrimination  between  two  targets;  Dr.  Dalcher 
described  the  criterion’s  relationship  to  the  discriminability  criteria  test.  Mr.  Tuley  also 
provided  support  to  our  concern  about  the  feasibility  of  being  able  to  make  accurate 
phase  measurements  at  low  SNR  values. 
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I.  INTRODUCTION 


Target  identification  remains  a  challenging  problem  for  air  combat,  due  often  to 
the  lack  of  an  adequate  number  of  ways  to  detennine  identities  of  all  targets.  High  range 
resolution  (HRR)  radar  offers  a  potential  additional  source  of  target  identification 
information  [1],  HRR  employs  a  high-bandwidth  radar  signal  to  find  the  locations  and 
relative  intensities  of  the  radar  scatterers  along  the  longitudinal  axis  of  a  target.  The  range 
resolution,  Ar,  is  given  by  Ar  ~  c  /  2B,  where  c  is  the  speed  of  light  and  B  is  the  radar’s 
bandwidth.  For  example,  a  radar  bandwidth  of  150  MHz  provides  1  m  range  resolution. 
A  radar  with  this  bandwidth  could  collect  30  separate  radar  cross-section  measurements 
along  the  length  of  a  30  m  long  aircraft.  One  form  of  HRR  involves  merely  detennining 
the  distance  between  the  first  and  last  return  from  a  target  to  discern  its  length  and 
thereby  yield  a  simple  target  classification  (e.g.,  fighter,  bomber,  cruise  missile).  A  more 
sophisticated  approach  involves  measuring  each  of  the  returns  along  the  target’s  length 
and  comparing  the  locations  and  relative  intensities  of  the  observed  scatterers  to 
prestored  target  templates.  The  second  approach  typically  produces  more  detailed  target 
classification  (e.g.,  F-15,  MiG-29). 

Range-Doppler-imaging  (RDI)  radar  offers  an  alternative  way  to  use  radar 
measurements  to  classify  targets.  Synthetic  aperture  radar  (SAR)  involves  an  airborne 
radar  viewing  a  ground  target  from  various  angles  (as  the  radar  flies  by  or  around  the 
target)  to  create  a  two-dimensional  image  of  the  target.  RDI,  on  the  other  hand,  uses  a 
relatively  stationary  radar  to  observe  an  airborne  target  from  different  angles  as  the  target 
maneuvers  or  flies  by  the  radar.1  As  with  SAR,  RDI  forms  a  two-dimensional  image  of 
the  target  by  employing  HRR  techniques  to  detect  scatterers  in  the  radial  direction  from 
the  radar  and  observing  relative  Doppler  shifts  to  detennine  the  cross-range  position  of 
the  scatterers  (see  Figure  1,  which  illustrates  relative  velocity  differences  as  a  function  of 
distance  from  the  centroid  of  the  target  and  rate  of  apparent  target  rotation). 


1  The  stationary  radar  could  correspond  to  a  fixed  ground  radar  site.  Alternatively,  the  “stationary”  radar 
could  be  on  a  moving  aircraft,  but  all  the  target  measurements  would  be  made  with  respect  to  a  moving 
coordinate  system  that  moves  with  the  radar.  Also,  if  the  target  is  maneuvering  in  both  a  horizontal 
and  vertical  plane,  then  a  3-D  image  might  be  formed  [2],  but  we  will  not  consider  this  3-D  case  in  this 
paper. 
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Although  a  two-dimensional  image  would  surely  be  more  recognizable  by  a 
human,  we  wish  to  find  an  analytical  basis  for  quantifying  the  inherent  differences 
between  target  identification  based  on  one-dimensional  and  two-dimensional  target 
signatures.  Our  approach  starts  by  representing  a  set  of  expected  targets  (either  one¬ 
dimensional  or  two-dimensional)  as  templates,  typically  multidimensional  complex 
vectors. 


Radar 


.  Aircraft  path 

i  Aircraftvelocityvector 

•4—  Relative  velocity  due  to  aircraft  turning,  <x>  A CR 


Figure  1.  Basis  for  Cross-Range  Measurement 


Applying  a  matched-filter  computational  process  (Sections  III  and  IV),  the  vector 
templates  are  compared  with  a  target  return  signal  (corrupted  by  noise)  to  produce  a  set 
of  scalar  test  statistics.  The  test  statistics  are  compared  and  their  associated  probabilities 
of  observation  are  estimated.  These  probabilities  represent  the  probabilities  of  correct  and 
incorrect  target  classification.  The  probabilities  are  typically  presented  in  the  fonn  of 
confusion  matrices  (Chapter  II).  The  details  of  these  calculations  depend  on  the  statistics 
chosen  and  on  the  assumptions  regarding  the  characteristics  of  the  interference  (noise, 
jamming,  clutter)  associated  with  the  radar  measurements.  These  noise  characteristics 
govern  the  degree  to  which  correct  and  incorrect  classifications  can  occur.  Some  simple 
example  calculations  are  presented  in  Chapter  V.  The  methodology  reported  herein 
should  provide  insight  into  the  underlying  issues  associated  with  target  identification. 
Whereas  these  analyses  are  purely  theoretical,  in  Chapter  VI  we  discuss  ways  to  extend 
the  analyses  to  include  real-world  issues  associated  with  target  identification. 
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II.  CONFUSION  MATRICES 


Confusion  matrices  provide  a  convenient  format  for  representing  the  degree  to 
which  an  identification  process  can  correctly  or  incorrectly  classify  targets.  If  confusion 
matrices  exist  for  multiple  sensors,  then  the  confusion  matrices  can  be  combined 
relatively  easily  to  evaluate  the  result  of  a  fused-identification  (ID)  decision  [3],  We  will 
define  the  rows  of  the  matrix  to  correspond  to  true  target  types;  the  columns  then 
represent  the  possible  outputs  of  the  identification  sensor  (sometimes  the  definitions  for 
the  rows  and  columns  are  interchanged).  In  general,  the  column  headers  need  not 
correspond  to  the  row  labels.  For  example,  the  rows  could  represent  F-15,  F-16,  MiG-29, 
B-52,  and  Silkwonn.  A  sensor  that  determines  only  the  relative  length  of  such  targets 
might  only  provide  the  following  outputs:  long,  medium,  and  short,  which  would  then 
serve  as  the  column  headers  for  the  sensor’s  confusion  matrix.  However,  for  the  purposes 
of  this  paper,  we  wish  to  consider  the  special  case  of  confusion  matrices  that  have 
column  headers  that  correspond  to  the  row  labels,  for  reasons  that  will  become  clear  later 
on.  Each  row  of  the  confusion  matrix  represents  the  conditional  probability  of  each 
possible  sensor  output  if  the  target  were  truly  of  the  type  defined  by  the  row  label.  An 
ideal  confusion  matrix  of  this  form  would  equal  the  identity  matrix  with  probabilities  of 
one  on  the  diagonal  and  zeros  elsewhere.  A  highly  desirable  (and  potentially  more 
feasible)  confusion  matrix  would  have  its  highest  values  along  the  diagonal  but  would 
inevitably  have  some  probabilities  in  the  off-diagonal  elements.  In  any  case,  the  row 
entries  of  a  probability  matrix  must  add  to  one,  since  the  columns  represent  the  set  of  all 
possible  sensor  outputs  for  the  given  true  target.  In  the  interest  of  improving  the  ratio  of 
the  probabilities  of  correct  to  incorrect  reports  from  an  identification  sensor,  we  will  also 
consider  the  case  of  an  optional  extra  column  that  corresponds  to  an  unknown  or  no 
declaration,  as  shown  in  Figure  2.  In  Figure  2,  P(t2|ti)  represents  the  probability  of 
declaring  target  2  when  target  1  is  actually  being  observed. 

This  paper  addresses  a  method  to  derive  the  conditional  probability  entries  for 
these  confusion  matrices  for  both  one-dimensional  and  two-dimensional  target  ID  sensors 
(and  even  three-dimensional  sensors  if  they  should  exist). 


Sensor  Output 
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H 


2 
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Target  1 

Target  2 

Target  3 

No  Declare 

Target  1 

P(  tj  |  tj ) 

P(  t2  t|  ) 

P(  t3 1  tj  ) 

P(  ND  t, ) 

Target  2 
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P(  t2  t2 ) 
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P(  ND  1 12  ) 

Target  3 

P(  t,  1 13 ) 
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P(  t3 1 13 ) 

P(  ND  1 13  ) 

Figure  2.  Representative  Confusion  Matrix 

Before  discussing  procedures  for  calculating  the  entries  in  a  confusion  matrix, 
consider  the  simple  problem2  posed  below: 

•  Let  X be  a  random  variable  uniformly  distributed  between  0  and  A. 

•  Let  Y  be  an  independent  random  variable  uniformly  distributed  between  0 
and  B. 

•  What  is  the  probability  that  X  >  F? 


From  basic  probability  theory,  the  probability  density  of  X  =  x  is  given  by  fx(x)  = 
1/A.  Similarly,  fy(y)  =  1  !B.  Because  X  and  Y  are  independent,  their  joint  probability 
density  factors,  that  is,  f(x, y)  =  /j(x );/,  (>’)  =  l/AB.  If  A  >  B,  then  the  probability  of  X>  Y  is 
given  by 


BA  ,  B 

P{X  >  Y)  =  J dyj  f(x,y)dx  =  —  J ( A-v)dy 


=  !-A 

2  A 


(1) 


The  second  integral  corresponds  to  summing  the  horizontal  differential  strips 
shown  in  Figure  3.  If  vertical  strips  were  summed  instead,  then  the  following  integral 
would  apply: 


P(  X  >  Y)  = 


max(X) 


min(X) 


min(x,max(y)) 

\fi.x,y)dy 

min(7) 


ax 


(2) 


2  A  reader  well  versed  in  probability  theory  could  skip  to  Chapater  III,  but  the  following  equations  will 
be  applied  in  subsequent  sections  of  the  paper. 
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Figure  3.  Simple  Probability  Example 

Notice  that  as  now  formulated,  the  probability  of  X  >  Y  would  come  from  the 
same  general  equation  (2)  regardless  of  the  domain  of  the  random  variables  x  and  y. 
Moreover,  the  joint  probability  function,/ (x,  y),  need  not  be  unifonn.  That  is,  i  I'/fx,  y)  = 
fn(x,  y)  represents  the  joint  probability  of  a  sensor  providing  either  anXora  7  indication 
when  the  true  target  is  type  ti,  then  (2)  would  generate  the  conditional  probability 
/\Xjti) — the  probability  that  an  X  indication  would  be  provided  when  the  true  target  type 
is  ti.  If  the  identification  sensor  only  has  outputs  X  or  Y,  then  P  (Tjti)  =  1  -  P(X jfi).  If  the 
joint  probability  function  is  defined  over  all  real  numbers,  then  min(X)  =  -oo,  min(T)  = 
-oo,  max(X)  =  oo,  and  min(x,  max(F))  =  x. 

If  the  identification  sensor  has  three  possible  outputs  (X,  Y,  and  Z)  with 
probabilities  defined  over  all  real  numbers,  then  a  similar  process  could  be  used  to  find 
the  probability  that  output  X  occurs  (i.e.,  the  probability  that  the  random  variable  x  has 
the  largest  value)  when  the  true  target  type  is  f,  as  follows: 

.X 

dyjfTi{x,y,z)dz  .  (3) 


/>(x|ti )  =  P(x  >  y  and  x>  z)=  j"  dx  j" 


Equation  (3)  extends  to  any  number  of  output  states  in  an  obvious  way.  If  we 
further  assume  that  the  random  variables  x,  y,  and  z  are  independent,  that  is ,fn(x,  y,  z )  = 
fxi(x)  fniy)  fzi(z),  and  can  each  range  from  -oo  to  +oo,  then: 


P(x|tl)=(x  >y  and  x>z) 

(4) 

=  {  fxi  ( x)FYi  (x)FZi  (x)dx 


j  fxi(x)dx  ^fYi(y)dy  ■  J"  fzi (z)dz 
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where  FY,  and  FZi  are  the  cumulative  distributions  corresponding  to  /«  and  fZi, 
respectively. 

If  independence  among  the  random  variables  associated  with  each  decision  is 
assumed  for  n  possible  sensor  outputs,  labeled  X\  through  X„,  then  for  output  Xp. 

oo  _  Xk 

p(xk  I Ti)=  J  fxSxk)dxk\\  \.fxiw)dw 

-00  j*k 

-00 


00 

=  |  fxt(xk)dxk 
-00 


r ~\FXj{xk) 

j= i _ 


FxSxk) 


(5) 


where  the  density  and  cumulative  functions  correspond  to  a  true  target  type  of  t,. 

Equations  (3)-(5)  all  address  the  case  where  a  sensor  decision  must  be  made. 
There  are  several  ways  to  include  the  possibility  of  “no  declaration.”  For  example,  one 
can  require  the  “winning”  signal  Xk  not  only  to  be  the  largest,  but  to  also  exceed  a 
threshold  value  a. 


Xk  >  Xj  and  Xk  >  a  for  all  k  ±  j.  (6) 

Signals  that  fail  to  satisfy  this  requirement  yield  a  no-declaration  designation.  A 
natural  choice  for  the  threshold  a  is  to  require  signals  less  than  or  equal  to  a  to  capture  a 
prescribed  fraction,  p(a),  of  the  probability  density  function  (PDF)  of  the  true  target, 

a 

P(a)  =  FXt  (a)  =  |  fXk  (x)dx  .  (7) 


Users  requiring  high  confidence  in  making  correct  target  assignments  might  set  p 
to  a  high  value  (say  p  =  0.5);  users  wishing  to  force  decisions,  but  not  requiring  high- 
confidence  decisions,  would  conversely  set  p  to  a  low  value.  The  threshold-conditioned 
confusion  matrix  entries  are  found  by  replacing  the  lower  limits  on  the  integrals  in  (3), 
(4),  and  (5)  by  the  threshold  value  a.  In  the  event  that  the  Xk  variables  are  independent,  it 
can  be  shown  (see  appendix,  Section  A-l)  that  the  no-declaration  probability  is  given  by 
the  simple,  and  generally  obvious,  expression 

n 

P( ND  \Tj)  =  Y[FXj(a).  (8) 

7=1 

Alternatively,  one  can  require  the  winning  signal  to  exceed  all  other  signals  by  a 
designated  amount  s. 
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Xk  >Xj+  s,  for  all  k  ±  j.  (9) 

For  example,  one  could  choose  s  to  be  a  measure  of  the  intrinsic  noise  in  the 
signal  x  (e.g.,  s  equals  one  standard  deviation),  thus  increasing  the  confidence  that  the 
winner  is  not  a  statistical  outlier.  When  imposing  condition  (9),  the  confusion  matrix 
entries  are  found  by  setting  the  upper  limits  of  the  interior  integrals  (all  integrals  except 
for  the  final  integral  over  xk)  in  (3),  (4),  and  (5)  to  x  -  s. 

Finally,  one  could  require  that  the  winner  both  exceed  a  threshold  a  and  exceed 
all  other  signals  by  s,  that  is, 

Xk>Xj+  sandXk>  afor  allk^j.  (10) 

In  this  case  the  confusion  matrix  entries  are  found  by  setting  all  lower  limits  to  a 
and  all  interior  integral  upper  limits  to  x  -  e  in  (3),  (4),  and  (5). 

In  summary  then,  all  we  need  to  know  is  the  joint  distribution  fTi(x,y,  z,...)  or,  if 
independence  applies,  fx(x),  fy(y),  fz(z),...,  to  in  principle  compute  the  conditional 
probabilities  associated  with  the  probability  matrixes,  regardless  of  whether  the 
confusion  matrices  do  or  do  not  include  a  no-declare  column.  A  later  section  will  discuss 
the  exact  conditions  required  for  independence. 
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III.  TARGET  REPRESENTATIONS 


An  HRR  radar  measures  the  radar  return  from  various  range  bins  along  the  length 
of  a  target.  An  RDI  radar  determines  the  returns  from  various  range/cross-range  bins.3 
Figure  4  shows  a  simplistic  representation  of  the  relative  magnitudes  of  the  radar  energy 
returned  by  the  scatterers  associated  with  a  hypothetical  target  for  both  RDI  and  HRR.4 
The  two-dimensional  pattern  represents  the  results  of  the  RDI  measurements.  The  one¬ 
dimensional  pattern  at  the  bottom  represents  the  cross-range  sum  of  the  relative  intensity 
of  all  of  the  scatterers  in  a  given  range  bin.5  The  two-dimensional  pattern  represented  in 
Figure  4  contains  30  x  30  =  900  separate  relative  RCS  measurements  over  a  target.  The 
assumed  RCS  values  for  each  of  these  range/cross-range  bins  are  not  shown,  but  the  16- 
level  color  scale  illustrates  how  the  intensities  vary  across  the  target.  The  rectangular  box 
of  numbers  in  Figure  4  does  show  the  cross-range  sum  of  all  of  the  RCS  values  in  a  range 
bin.  The  color  bar  at  the  bottom  of  the  figure  shows  the  relative  intensities  of  the 
resulting  one-dimensional  profile,  again  on  a  16-level  color  scale.6 

We  wish  to  represent  the  one-dimensional  pattern  of  relative  intensities  as  a 
vector  called  S id.  That  is,  S id  =  (2,  22,  4,...,  2)r,  where  the  superscript  T  indicates  the 
transpose  operation.  In  a  similar  manner,  the  relative  intensities  of  the  900  two- 
dimensional  returns  could  be  recorded  in  a  two-dimensional  table,  or  they  could  just  as 
easily  be  arranged  in  the  fonn  of  a  900-element  one-dimensional  vector.  For  reasons  that 
will  become  apparent  shortly,  we  wish  to  represent  two-dimensional  targets  using  the 
latter  fonn. 


3  An  RDI  radar  directly  measures  target  returns  in  a  polar  pattern;  interpolation  is  then  required  to 
generate  a  return  pattern  on  a  rectangular  grid  [2]. 

4  In  reality,  an  actual  target  will  likely  have  several  scatterers  within  a  given  range  or  range/cross-range 
bin,  so  the  radar  return  received  for  each  radar  bin  will  likely  vary  as  the  viewing  geometry  changes. 

5  A  straight  summation  of  the  radar  cross  section  (RCS)  associated  with  the  various  two-dimensional 
scatterers  is  also  not  totally  correct,  because  there  would  be  varying  phase  cancellation  across  a  one¬ 
dimensional  range  bin  as  the  viewing  angle  changes.  This  assumption  is  made  at  this  point  merely  to 
facilitate  the  initial  description  of  our  approach. 

6  The  white  regions  represent  the  brightest  regions  in  the  two-dimensional  and  one-dimensional  grids; 
however,  these  two-dimensional  and  one-dimensional  white  regions  have  different  RCS  values. 
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Figure  4.  Simple  Target  Representation 

Either  the  one-dimensional  or  two-dimensional  target-characterization  vectors 
could  be  normalized  to  one.  Although  not  strictly  necessary,  we  believe  this 
nonnalization  facilitates  the  analysis  and  helps  in  visualizing  the  computational  process. 
Furthermore,  normalization  of  the  vectors  will  make  the  target  templates  (as  defined  by 
the  one-dimensional  or  two-dimensional  vectors)  independent  of  the  range  between  the 
radar  and  the  target. 

The  foregoing  discussion  introduced  the  idea  of  representing  a  target  as  a  one¬ 
dimensional  or  two-dimensional  pattern  of  relative  intensities  or  RCS  values.  Instead  of 
reporting  the  relative  RCS  (or  energy  or  signal  power)  for  each  of  the  bins,  a  radar  could 
report  the  voltage  associated  with  each  bin.  In  reality,  a  radar  will  receive  a  waveform 
that  contains  both  magnitude  and  phase.  If  the  radar  reports  only  the  magnitude  of  the 
received  signal,  the  elements  of  the  target  report  vectors  consist  of  a  series  of  real, 
positive  numbers.  Alternatively,  a  series  of  complex  numbers  could  be  used  to  represent 
both  the  magnitude  and  phase  of  the  received  signal. 

While  an  actual  radar  would  measure  the  elements  of  the  appropriate  one¬ 
dimensional  or  two-dimensional  vector,  each  component  in  the  vector  would  include 
some  noise,  as  follows: 

Z  =  S  +  N  (11) 

where  Z  is  the  received  signal  recorded  by  the  sensor,  S  is  the  vector  of  complex  voltages 
received  from  the  target  when  the  noise  is  zero,  and  A  is  a  noise  vector  of  complex 
voltages  that  specifies  the  amount  of  noise  added  to  each  component  of  S.  We  will  make 
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the  standard  simplifying  assumption  that  all  of  the  components  of  TV  are  zero-mean, 
normally  distributed  random  variables,  and  that  each  component  has  an  identical  standard 
deviation,  o,  for  both  the  real  and  imaginary  parts  of  each  component.  Hence,  the 
expected  value  of  Z  (which  we  denote  as  <  Z  >)  equals  S,  and  the  covariance  matrix  of  Z 
equals  2  a2/,  where  I  is  the  identity  matrix  with  the  number  of  rows  and  columns  equal  to 
the  number  of  components  in  Z.  In  mathematical  notation: 

<Z>  =  S  and  <  Z  ZH  >-  S  SH  =  2  a2/  =  <  TV  NH  >  (12) 

where  H  indicates  the  Hennitian  adjoint  or  conjugate  transpose,  since  the  elements  of  Z 
and  S  are  assumed  to  be  complex.  The  appendix  (Section  A-2b)  provides  proofs  of  these 
relationships.  The  factor  of  2  appears  in  (12)  because  both  the  real  and  imaginary  parts  of 
each  component  of  TV  are  assumed  to  have  a  standard  deviation  of  cr.  It  is  easily  shown 
(see  Section  A-3  and  below)  that  the  magnitudes  of  the  components  of  the  noise  vector  TV 
have  a  Rayleigh  distribution,  and  the  magnitudes  of  components  of  the  noisy  target 
vector  Z  have  a  Rician  distribution  (see  Section  A-4). 
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IV.  MATHEMATICAL  PROCEDURES  FOR  TARGET 

RECOGNITION 


As  mentioned  before,  the  ideal  confusion  matrix  is  the  identity  matrix  (if  the  “no 
declare”  column  is  eliminated).  Therefore,  one  wants  to  maximize  the  probability  of 
declaring  that  a  target  return  indicates  a  target  of  type  X  when  the  underlying  target  is 
also  of  type  X.  Brennan  and  Reed  [4]  provided  a  matched  filter  fonnulation  that 
maximizes  the  signal-to-noise  ratio  (SNR)  when  the  actual  target  being  observed,  Z 
(where  Z  =  S  +  AO,  and  the  target  template,  S,  agree.  Intuitively,  this  would  seem  to 
produce  maximum  values  along  the  diagonal  of  the  confusion  matrix.7  Brennan  and  Reed 
show  that  the  maximum  SNR  obtainable  for  a  target  occurs  when 

SNR  =  SNRmax  =  SHMlS  (13) 

where  M  is  the  noise  covariance  matrix.  For  our  assumed  noise,  see  (12),  M=  2a2 1.  The 
1  2 

inverse  of  M  is  M  =1/(2  a").  Considering  only  unit  vectors  (i.e.,  \S\  =  1),  then  SNRmax 
=  1  /  (2a-2),  since  SUS  =  1.  For  (13)  to  apply,  the  elements  of  the  complex  vector  S  must 
have  units  of  voltage.  The  perceived  SNR  for  an  actual  observation  vector,  Zx  =  Sx  +  N, 
against  a  test  target,  Sx,  becomes 

SNR  xx  =  Sx'  Ma  Zx  (14) 

where  SNRxx  is  a  random  variable,  because  Zx  is  a  random  variable.  Likewise,  when 
testing  against  template  St,  the  expected  value  of  SNRxt  is 

<  SNRxt  >  =  <  StHM 1  (Sx+  N)  >  =  SrHSx  /  (2a)  (15) 

because  <STH N>  =  STH <N>=  0. 

Figure  5  illustrates  the  application  of  (14)  and  (15).  This  figure  represents  the 
expected  real  and  imaginary  components  of  the  complex  values  SNRxx  and  SNRxt- 
Because  Sx'Sx  -  1,  the  expected  value  <  Sx'Zx  >  has  real  and  imaginary  components  of 
1  and  0,  respectively.  For  a  randomly  chosen  Zx,  SXH Zx  and  SjH Zx  will  typically  lie 
somewhere  away  from  the  expected  positions  of  the  phasors  shown  in  Figure  5, 


7  Alternatively,  one  could  view  the  problem  as  one  of  maximizing  likelihood  ratios  and  similarly 
calculate  a  decision  statistic. 
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depending  upon  the  value  of  cr.  Figure  5  also  illustrates  two  possible  ways  to  determine 
whether  Zx  better  corresponds  to  either  Sx  or  Sj.  In  one  case  we  compare  the  magnitudes 
using 

\SXH  Zx\>\StH  Zx\.  (16a) 

In  the  other  case  we  compare  only  the  real  parts  using 

Refs’/7  Zx)  >  Rc(Srl!Zx).  (16b) 

If  the  inequalities  are  satisfied,  then  one  could  conclude  that  Zx  more  closely 
corresponds  to  ,S’X  than  to  ,S’T.  From  Figure  5,  it  appears  that  (16b)  should  outperform8 
(16a),  although  both  should  do  well  when  the  SNR  is  high,  that  is,  cr  — >  0.  However, 
(16b)  is  sensitive  to  one’s  ability  to  detennine  the  absolute  phase  of  Zx.  By  absolute 
phase  we  mean  determining  the  phase  of  the  first  component  of  the  vector  Zx  correctly 
and  then  determining  the  phase  of  all  other  components  of  Zx  relative  to  this  first 
component.  But  when  the  SNR  is  low  (and  when  the  ability  to  make  accurate 
discriminations  is  most  desired),  accurate  absolute  phase  measurements  become  most 
difficult.  In  fact,  if  every  component  of  the  vector  Zx  is  rotated  180  deg  due  to  an 
absolute  phase  measurement  error,  then  the  real-part-decision  criterion  would  produce  a 
reversed  decision.  On  the  other  hand,  the  magnitude-comparison  criterion  would  be 
unaffected  by  the  absolute  phase-measurement  error.  Therefore,  in  the  remainder  of  this 
paper  we  will  examine  in  detail  the  application  of  magnitude  comparisons  as  defined  by 
(16a). 


Figure  5.  Depiction  of  Expected  Values  of  Test  Statistics  in  the  Complex  Plane  and 
Possible  Comparisons  Between  Them 


As  we  were  finalizing  this  paper,  a  colleague,  Amnon  Dalcher,  pointed  out  that  the  real-part 
comparison  is  theoretically  best  when  the  inherent  target  magnitude  is  known  while  the  absolute-value 
comparison  is  best  when  the  inherent  target  magnitude  is  not  known. 
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The  foregoing  reasoning  gives  the  procedure  for  calculating  the  entries  in  a 
confusion  matrix,  namely: 

•  Assume  a  target  of  type  i  with  template  S,  such  that  Z,  =  5)  +  N.  This  assumes 
that  the  true  target  template  is  known  perfectly.  This  also  assumes  that  SiHSi 
=  1  such  that  <|Z|>  =1.  The  impact  of  these  assumptions  could  be  explored 
by  analyzing  a  number  of  real-world  cases. 

•  Compute  the  inner  (dot)  product  of  Z  with  each  of  the  possible  target 
templates,  Sp  to  form  a  test  statistic  Ty\  =  | Sf  Zj,  where  \Ty |  is  a  random 
variable.  The  test  statistic  involves  a  magnitude,  since  Ty  is  generally  a 
complex  number. 

•  Detennine  the  value  of  each  confusion  matrix  entry  by  evaluating  the 
probability  expression  /’(declare  j  |  target  i)  =  P(\Ty\  >  \Tu |,  \Ti2\,...,  \Tik\,...) 
for  all  k  ±  j. 

Since  the  random  vector  Z,  appears  in  each  of  the  random  variables  Ty,  there  are 
cross  correlations  among  the  Tys.  As  a  result,  one  would  typically  have  to  employ  (3), 
instead  of  the  more  tractable  (4)  or  (5).  The  degree  to  which  the  Tys  (and  hence  the  \Ty\s) 
are  correlated  is  found  from  the  joint  expectation  of  Ty  and  7A  or,  more  generally,  the 
covariance  matrix,  defined  by 

CJk  =  (( v  <  Ty  >)\Tik-<  Tik  >)}  .  (17) 

In  the  appendix  [see  Section  A-5,  (A26)]  it  is  shown  that  the  covariance  matrix  is 
given  by  Cjk  =  2a2A///A/.9  This  implies  there  will  be  little  correlation  of  the  test  statistics 
if  the  underlying  target  templates  differ  significantly  (i.e.,  if  \SkHSy\  «  1 ,  except  when  j  = 
k).  The  more  orthogonal  the  target  templates,  the  less  risk  one  has  in  employing  (4)  or 
(5) — equations  that  assume  statistical  independence  among  all  the  test  statistics. 


9  Note  that  the  covariance  matrix  of  the  complex  values  Ty  does  not  depend  on  the  true  target  St.  We 
later  show,  however,  that  the  covariance  matrix  of  the  magnitudes  \Ttj\  does  depend  on  the  true  target. 
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V.  EXAMPLE  CASES 


In  this  section,  we  present  several  example  cases  that  illustrate  the  theoretical 
development  outlined  above.  We  begin  with  the  coherent  processing  case  even  though  in 
practice  noncoherent  processing  is  generally  more  predictable.  The  coherent  case, 
especially  when  the  targets  are  not  correlated,  yields  a  rather  simple  formulation.  It  also 
illustrates  possible  upper  bounds,  assuming  one  is  able  to  measure  the  relative  phase 
among  the  components  of  a  target  template.  Our  use  of  (16a),  however,  means  that  our 
coherent  processing  results  do  not  require  an  absolute  measurement  of  target  phase;  that 
is,  our  coherent  process  is  insensitive  to  an  absolute  phase  rotation  of  all  elements  in  the 
measurement  vector  as  might  occur  at  low  SNR  when  detennining  phase  becomes 
especially  difficult. 

A.  UNCORRELATED  TARGETS— COHERENT  PROCESSING 

This  case  treats  the  ideal  situation  where  all  pairs  of  target  templates  have  zero 
inner  product  (no  overlap).  Although  this  condition  is  not  likely  to  occur  in  practice,  we 
believe  it  is  worthwhile  to  examine  this  case  because  the  no-overlap  condition  could  be 
approximately  met  in  some  cases.  The  independent  target  case  also  establishes  the  “best 
case”  confusion  matrix  for  a  set  of  targets.10  Let  there  be  n  mutually  orthogonal  targets 
denoted  by  S)  ,  i  =  1,  2,...,  n.  As  in  Chapter  IV  above,  the  test  signal  associated  with 
target  i  is  assumed  to  be  Z,  =  5)  +  N,  where  A  is  a  complex,  zero  mean,  Gaussian  noise 
vector.  Consider  a  given  row  i  of  the  confusion  matrix;  this  is  the  row  associated  with  the 
true  target  St.  There  are  then  n  complex  random  variables  of  interest,  given  by  T,k  =  S^Z,, 
k  =  1,  2 ,...,«.  For  notational  clarity,  in  the  remainder  of  this  example  we  will  suppress 
the  true  target  subscript  i,  that  is,  7)/->7).  Because  the  targets  are  orthogonal,  the 
covariance  matrix  is  diagonal,  that  is 

Cjk  =  ((Tj -  <  Tj  >)*(Tk -  <  Tk  >)}  =  2ct25 jk  ( 1 8) 


10  By  best  case,  we  mean  the  highest  probability  of  correct  target  recognition  for  a  given  SNR. 
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and  the  random  variables  Tk  are  independent.  Because  the  Tk  are  independent,  their 
magnitudes,  Rk  =  \Tk\,  are  also  independent,  with  joint  expectation  [see  Section  A-4, 
(A27)]  given  by 

<RjRk>=  (1  +  2cr2  )5 jk .  (19) 

Accordingly,  the  joint  density  of  the  Rks  factors, 

Wi  A2 ....  ,Rn )  -  A  (Ri  )fi  (Ri  )■■■/,  (*„ )  •  (20) 


All  that  is  needed  are  expressions  for  the  individual  densities  fk  on  the  right-hand 
side  of  (20).  In  the  appendix  (Section  A-5)  it  is  shown  that  for  k  =  i  (if  the  template 
matches  the  true  target),  fk  is  a  Rician  probability  density;  otherwise,  fk  is  a  Rayleigh 
probability  density, 


fk(Rk) 


(V+i) 

e  k  =  i 


(21) 


By  applying  (5),  the  confusion  matrix  entries  for  a  given  row  can  then  be 
computed.  We  first  recognize  that  neither  PDF  in  (21)  contains  any  parameters  of  the  true 
target  S, — only  the  noise  parameter  cr  is  present.  It  follows  that  one  need  only  to  compute 
the  matrix  elements  for  a  single  row.  Further,  from  the  structure  of  (5)  it  is  clear  that 
there  are  just  two  unique  matrix  elements:  all  k  =  i  (diagonal)  elements  are  the  same,  and 
all  k^  i  (off-diagonal)  elements  are  the  same.11  This  makes  sense  intuitively  because  the 
mutual  orthogonality  of  the  targets  prevents  information  from  target  i  being  gained  by 
comparing  it  (dotting  it)  with  target  k.  Finally,  since  the  rows  of  the  confusion  matrix 
sum  to  one,  and  the  off-diagonal  elements  are  the  same,  the  off-diagonal  elements  can  be 
computed  once  the  diagonal  element  is  known.  Therefore,  there  is  just  one  integral  to 
compute,  the  diagonal  matrix  element,  given  by 


OO  («2+l) 

(R  \ 

\Re  2-2  /0(A-) 

J  ue  2tJ~  du 

0 

A 

(22a) 


11  This  assumes  square  confusion  matrices  that  do  not  contain  a  no-declare  column.  If  a  no-declare 
column  were  included,  the  limits  in  the  ensuing  integrals  would  have  to  be  adjusted  accordingly.  Also, 
it  assumes  that  the  os  are  all  the  same  for  each  true  target  type,  which  may  not  always  be  a  reasonable 
assumption,  as  discussed  in  Chapter  VI. 
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=  V  (-!/(»-!)!  e~  2o\k+D  (22b) 

f^(k  +  mn-k-l)\ 

which  follows  from  (22a)  by  expanding  the  result  of  the  interior  integral  as  a  binomial 
and  applying  the  integral 

oo  2 

j  Re-aR"  Jn(fiR)dR  =  ^  (22c) 

0 

to  the  result  [5].  The  off-diagonal  elements  are  given  by  the  simple  expression 

<23) 

n  -1 

This  yields  the  surprising  result  that  the  independent-target,  coherent-processing 
case  gives  rise  to  a  universal  confusion  matrix,  whose  elements  are  independent  of  the 
details  of  the  individual  targets. 

Figure  6  shows  a  plot  of  the  diagonal  matrix  element,  (22),  as  a  function  of  the 
SNR:  SNR  =  l/(2a2),  for  n  =  2,  5,  10,  and  20  targets.  As  the  SNR  approaches  zero,  P(\<ag 
approaches  1  /n;  in  this  limit,  the  off-diagonal  elements  (23)  equal  the  diagonal  element, 
and  there  is  equal  probability  of  identifying  any  one  of  the  targets  as  the  true  target.  As 
the  SNR  increases,  Rdiag  approaches  one;  correspondingly,  the  off-diagonal  elements  all 
approach  zero. 


Independent  Targets:  Coherent  Processing 


Figure  6.  Diagonal  Matrix  Element  as  a  Function  of  SNR  for  Independent  Targets 
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B.  CORRELATED  TARGETS— COHERENT  PROCESSING 


In  practice,  there  will  always  be  overlap  between  targets.12  For  each  true  target  Si, 
one  is  then  confronted  with  a  set  of  n  correlated  complex  random  variables  T,/(  =  S/nZ„  k 
=  1,  2,...,  n  with  covariance  matrix  Cjk  =  2a  Sk  Sj  and  the  corresponding  correlated 
magnitudes  R&  =  \Ty\.  Here  and  throughout  this  section  the  true  target  subscript  i  is 
retained.  To  compute  confusion-matrix  entries,  one  must  use  the  general  probability 
relation  given  in  (3);  however,  a  prerequisite  is  the  joint  PDF  of  the  magnitudes,  f(Rn, 
Ra,...,  Ri„).  We  show  in  the  appendix  [Section  A-5,  (A33)]  that  the  PDF  for  each 
amplitude  Rn,  is  given  by  the  Rician  density 

( Rik~+cik~ ) 

./;,(%}  =  ^  I0(^)  (24) 

a  a 

H 

where  c*=|  Sk  Sj  |.  To  construct  the  exact  joint  PDF  of  the  correlated  Rician  random 
variables,  the  second-order  correlations,  <RlJ  R,k>,  and  all  higher  order  correlations  are 
needed.  Unfortunately,  to  our  knowledge,  neither  these  correlations,  nor  the  joint  PDF 
itself,  can  be  computed  in  closed  form  [6]. 


In  spite  of  this,  an  approximate  joint  PDF  can  be  constructed  in  the  high  signal-to- 
noise  limit  (i.e.,  when  o/c,i  «1).  In  this  limit,  the  Rician  PDFs  in  (24)  are  well 
approximated  by  Gaussian  PDFs  (see  Section  A-4  for  justification),  and  the  joint  PDF  is 
well  approximated  by  a  multidimensional  Gaussian  of  the  form  [7] 

M*i)”  i  1 

det(AT, )  (25) 

exp(-y(R,  -  fij)7 K^iRj  -//,  )) 


where  R  =  (Rn,Ra,...,Rin)T ,  //,  =  (R;) ,  and  K,  is  the  covariance  matrix  of  the  amplitudes. 

The  components  of  the  mean  vector  can  be  computed  exactly  from  the  mean  of  a  Rician 
random  variable  (see  Section  A-4): 


<  Rik 


(2  +  ^)/0(^)  +  ^A(^) 


(26) 


where  7o  and  I\  are  the  zeroth  and  first-order  modified  Bessel  functions  of  the  first  kind. 
The  elements  of  the  covariance  matrix  for  true  target  St  are  given  by 

Kjk(  0  s  {(Ry  ~  <  Ry  >)(Rik  ~  <  Rik  >))  =<  RijRik  >~<Rij  ><  Rik  >  ■  (27) 
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A  judicious  choice  of  target  vectors  could  conceivably  help  reduce  the  overlap  between  targets, 
however.  This  topic  is  taken  up  in  Chapter  VI. 
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The  diagonal  (j  =  k )  elements  of  the  joint  expectation  of  Ry  and  Rlk  can  be 
computed  exactly  [see  (A21)]: 

<  Rfj  >=  Cy  +  2a2 .  (28) 

Although  it  is  not  possible,  to  our  knowledge,  to  compute  the  off-diagonal 
elements  of  the  joint  expectation  of  Ry  and  Rlk  exactly,  a  lengthy  but  straightforward 
calculation  (see  Section  A-5)  shows  that  to  order  cr,  the  joint  expectation  is  given  by 

<  RijRik  >~CijCik  +  ~~~ —  {4  +  C2k  +  2 Cijcikckj  cos (<fij  -  <j>ik  -  4 ))  (29) 

Zcijcik 

where  the  phases  are  defined  though  the  relations  SHSt i=cije"^iJ ,  SnS,  =  cjke^ik ,  and 
SHSk  =  ck)e'ki  .13  With  the  mean  and  covariance  in  hand,  the  confusion  matrix  entries  for 

high  signal  to  noise  can  be  estimated  by  numerically  integrating  (3),  using  the  joint  PDF 
(25),  repeated  here  using  the  notation  of  this  section: 

f  \ 

CO  „  Rij 

P(Sj  \s,)=  | dRy  n  I dRik  MR, )  •  (30) 

—00  k=l  —co 

\k*J 

We  give  here  an  example  with  the  simplest  possible  case  of  two  targets,  ,Sj  and  W. 
This  case  leads  to  several  simplifications.  First,  there  is  only  a  single  off-diagonal 
element  in  the  covariance  matrix  (27).  Further,  the  joint  expectation  (29),  and  therefore 
the  covariance  matrix  (27),  does  not  depend  on  the  phases.  For  true  target  S 1  (/  =  1)  (29), 
reduces  to 

<RnRn  >=c12+^(l  +  3cf2)  (31) 

Zc\2 

again  valid  to  order  <j.  An  expression  for  the  mean  of  a  Rician  in  the  high  SNR  limit  is 
derived  in  the  appendix.  Applying  this  expression  to  R\  \  and  R\i  yields 

<^ii>=1  +  ^-  (32a) 

<Rn>=cn+-^—.  (32b) 

2c[2 

By  combining  (27),  (28),  (31),  and  (32)  and  simplifying,  we  arrive  at  a  simple 
approximate  expression  for  the  covariance  matrix  associated  with  true  target  5): 

13  Note  that  for  j=  k  (diagonal  elements),  expression  (29)  is  exact  and  reduces  to  the  result  in  (28). 
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(33) 


*i  = 


^  c12<7  (  1-^r) 


4q2 


It  is  evident  that  the  covariance  matrix,  K2,  associated  with  target  S2,  is  obtained 
by  interchanging  the  diagonal  elements  of  K\.  Because  of  the  symmetry  of  the  two-target 
case,  the  diagonal  elements  of  the  confusion  matrix  are  identical,  as  are  the  off-diagonal 
elements.  Within  the  Gaussian  approximation,  there  are  then  just  two  parameters  that 
define  the  two-target  case — the  magnitude  of  the  target  correlation,  c\2=\  S2HS 1  |,  and  the 
noise  strength  cr.14 

Figure  7  shows  the  diagonal  element  of  the  two-target  case,  that  is,  the  probability 
of  correct  target  identification,  computed  from  the  Gaussian  approximation  (30),  as  a 
function  of  SNR  =  l/(2cr)  for  four  values  of  the  target  correlation:  cn  =  0.3,  0.5,  0.7,  and 
0.9.  For  comparison,  the  independent  target  result,  (22),  is  also  shown.  Also  plotted  are 
simulated  results  for  two  two-dimensional  complex  templates  with  correlations  0.3,  0.5, 
0.7,  and  0.9. 15  Simulations  were  performed  by  sampling  the  signal  vector  100  000  times, 
projecting  each  onto  the  2  templates  vectors,  and  computing  the  fraction  of  cases  when 
one  projection  exceeds  the  other.  The  agreement  between  the  Gaussian  approximation 
and  simulated  results  is  seen  to  improve  with  increasing  SNR  and  for  increasing 
correlation.  Because  the  Gaussian  approximation  holds  only  for  high  SNR,  the  Gaussian 
approximation  curves  in  Figure  7  are  limited  to  SNR  values  that  correspond  to  the 
condition  dc\2  <  1.5.  It  is  apparent  from  the  figure  that  the  probability  of  correct  target 
ID  diminishes  with  increasing  target  correlation  and  that  a  high  degree  of  target 
correlation  (e.g.,  c\2  =  0.9)  degrades  the  probability  significantly  from  the  independent 
target  result  (roughly  20  percent  for  an  SNR  =  10). 


14 

15 


It  can  be  shown  that  the  two-target  case  depends  only  on  Cn  and  a  for  the  entire  range  of  SNR  values. 


The  template  vectors  were  taken  to  be  Sl  =-jj 


1 

vly 


and  S2  = 


a 


JT- 


2  /e 
a  e 


.  Overlap  values  of  0.3,  0.5, 

y  V  1  J 

0.7,  and  0.9  can  be  obtained  with  parameter  pairs  (a,  9)  =  (0.4756,  2.9413),  (0.2645,  2.9413),  (0.0102, 
2.9413),  and  (0.7071,  0.9021),  respectively. 
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Figure  7.  Diagonal  Matrix  Element  as  a  Function  of  SNR  for  Two  Correlated  Targets 

C.  CORRELATED  TARGETS— NONCOHERENT  PROCESSING 

An  alternative  approach  to  the  coherent  processing  case  is  to  represent  all  target 
templates  as  real  vectors  by  retaining  only  the  magnitudes  of  the  components  of  the 
nonnalized  complex  templates  A,  above.  The  real  template  vectors  are  denoted  by 

A  (b/l >A'2.’”.Am )  (1^/1  U  si2  lv,l  $im  |)  , 

i=l,2,...,n,  (34) 

where  m  is  the  number  of  components  of  St.  Since  the  S,  are  nonnalized  according  to 
SjHS,= 1,  the  Aj  are  automatically  normalized  according  to  A[A  j=\ . 

Similarly,  signal  vectors  are  represented  by  vectors  whose  components  are  the 
magnitudes  of  the  components  of  the  complex  vector  Z,  =  S,  +  N  .  Hence,  signal  vectors 
are  represented  by 

¥i  =(AiAi2v,Jim)7' 

=  (k-i  +n\  1.1 -2  +«2  lv-,k™  +nm  |)r 

i=  1,2  (35) 

In  radar  parlance,  this  is  known  as  noncoherent  processing.  As  discussed 
previously,  each  component  of  T,  is  a  Rician  random  variable  with  PDF 

/  2  ,  2x 

(>'«  +°,-t ) 

fik{yik)  =  ^e  *  70(^)  .  (36) 
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To  compare  template  Ak  with  true  target  F,-,  we  form  the  inner  product16 

m 

wik  =  AkTyi  =  •  (37) 

a=l 

For  each  true  target,  we  are  again  confronted  with  a  set  of  n  correlated  random 
variables  wik,  k  =  1,  2,...,  n.  Again,  the  joint  PDF  is  desired,  but  it  is  impossible  to 
compute  in  closed  form.  An  approximate  method  for  obtaining  the  joint  PDF  is  presented 
in  the  following.  Since  each  component  of  F  is  a  Rician17  random  variable,  wa  is  a 
strictly  positive  weighted  sum  of  Rician  random  variables.  By  the  central-limit  theorem, 
as  the  number  of  components,  in,  of  F,  increases,  the  density  function  of  vv,/,  approaches  a 
Gaussian.  Similar  to  the  coherent  case  above,  we  again  seek  a  joint  PDF  in  terms  of  a 
multidimensional  Gaussian 

fi(wi)  -  -j=  1  •  exp  (-{(tv,-  -  G  fgr  V/  -  £■))  (38) 

VGA)"  det(a ) 

where  wi={wa,wa,...,win)T ,  £ j=<wi> ,  and  Qi  is  the  covariance  matrix  for  the  random 
variables  w*.  The  mean  vector  Q  can  be  computed  exactly  with  the  help  of  (26).  In  the 
appendix  [Section  A-6,  (A55)]  we  show  that  the  covariance  matrix  Qt  can  be  computed 
exactly: 

m 

Qjk(i )  =<(wy-  <  ™ij  >)(wik-<  Wik  >)>=Yja jadfattka  (39) 

a= 1 

where  a;a  =<  yfa  >  -  <  yia  >2  is  the  variance  of  the  Rician  random  variable  yia,  an 
expression  for  which  is  given  in  the  appendix  (Section  A-4).  Finally,  the  confusion 
matrix  entries  can  be  found  by  employing  the  general  probability  expression  in  (3).  Note 
that  each  true  target  At  (zth  row)  is  associated  with  a  distinct  joint  density  f.  We  proceed 
with  a  numerical  example  to  illustrate  the  method. 

Figure  8  shows  images  of  three  target  templates,  fashioned  to  resemble  simple 
plan  views  of  three  (admittedly  bizarre)  aircraft.  The  targets  are  represented  by  4  x  5 
matrices,  given  by 


16  We  will  first  follow  the  same  recipe  as  just  developed  for  the  coherent  case.  However,  we  will  then 
demonstrate  that  some  difficulties  can  arise  at  low  SNR  values.  These  difficulties  ultimately  lead  to  an 
alternative  formulation  that  eliminates  the  low-SNR  problems. 

17  Some  of  the  terms  in  F  will  correspond  to  Rayleigh  random  variables,  but  the  Rayleigh  is  merely  a 
special  case  of  the  Rician. 
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Figure  8.  Three  Hypothetical  Templates  Represented  by  Matrices  in  (40) 

The  lighter  shading  in  the  figure  represents  the  larger  matrix  entries.  Each 
template  can  be  recast  as  a  20-dimensional  vector.  The  matrix  of  inner  products  between 
all  templates  is  then  given  by 


'  Al  Ax 

A\  A2 

r  1 

0.8 

0.149' 

AlA\ 

a\a2 

A2A3 

= 

0.8 

1 

0.0745 

vAr  A\ 

A3A2 

A3A3j 

0.149 

0.0745 

1  , 

Consistent  with  Figure  8,  there  is  significant  overlap  between  targets  1  and  2,  less 
overlap  between  targets  1  and  3,  and  very  little  overlap  between  targets  2  and  3. 

We  first  examine  pairwise  target  comparisons,  starting  with  targets  1  and  2  in 
Figure  8.  Figure  9  shows  a  plot  of  the  matrix  element  Pu  of  the  2x2  confusion  matrix  as 
a  function  of  SNR  =  l/(2cr),  both  with  the  Gaussian  approximation  in  (38)  (solid)  and 
with  direct  simulation  (open  circles).  Simulations  were  perfonned  by  sampling  the  signal 
vector,  with^i  as  the  true  target,  100  000  times,  projecting  each  onto  the  two  templates 
vectors  A\  and  A2,  and  computing  the  fraction  of  cases  when  one  projection  exceeds  the 
other.  The  good  agreement  between  the  Gaussian  approximation  and  simulated  results 
justifies  the  use  of  the  Gaussian  approximation  for  these  20-dimensional  target  templates. 
The  matrix  element  P22  exhibits  essentially  identical  behavior;  good  agreement  between 
the  Gaussian  approximation  and  the  simulated  results  is  also  observed.  Shown  for 
comparison  in  Figure  9  is  the  simulated  two-target  coherent  case  with  an  overlap  of  0.8. 
As  expected,  the  coherent  result  outperforms  the  noncoherent  result,  particularly  for  low 
SNRs. 
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Targets  1  and  2:  Overlap=0.8,  Noncoherent  Processing 


Figure  9.  Pu  vs.  SNR  for  Pairwise  Comparison  of  Targets  1  and  2  in  Figure  8 

Figure  10  shows  the  confusion  matrix  elements  P\ \  and  P33  as  a  function  of  SNR 
for  the  pairwise  comparison  of  targets  1  and  3  shown  in  Figure  8.  Displayed  in  the  plot 
are  the  confusion  matrix  elements  computed  with  the  Gaussian  approximation  and  with 
direct  simulation,  as  in  the  creation  of  Figure  9.  Again,  the  Gaussian  approximation  is 
seen  to  agree  well  with  the  simulated  results.  Also  shown  is  the  simulated  two-target 
coherent  case  for  an  overlap  of  0.15.  Unfortunately,  the  noncoherent  results  in  Figure  10 
make  no  sense.  At  very  low  SNR  values,  one  would  expect  that  targets  1  and  3  would  be 
indistinguishable  and  each  would  be  selected  with  a  50-percent  probability  when 
choosing  only  between  these  two  targets. 

The  explanation  for  this  behavior  is  as  follows.  It  is  shown  in  the  appendix  (A  19) 
that  for  small  A/ <j  (low  SNR),  the  mean  of  a  Rician  approaches  crjf  (this  is  the  mean  of 
a  Rayleigh  random  variable).  Therefore,  the  expected  measured-signal  vector  approaches 
<r^f  (1,1,...,  I)7.  It  follows  from  the  definition  of  wu t  in  (37)  that  the  template  A/,,  whose 
components  sum  to  the  largest  value,  will  yield  the  largest  projection  (alternatively,  the 

template  that  “looks”  most  like  the  (1,1,1, _ ,1)^  vector  “wins”).  Consequently,  the 

probabilities  associated  with  this  template  will  be  skewed  to  larger  values.  In  the  case  of 
targets  1  and  3,  the  component  sums  are  VTo  =  3.16  and  lo/Vl8  =  2.36.  This  explains  why, 
in  Figure  10,  as  SNR  decreases,  P\\  is  abnormally  large  and  P33  is  abnormally  small.  In 
each  case  the  projection  onto  target  1  is  consistently  larger  than  the  projection  onto  target 
3.  We  emphasize  that  this  effect  is  a  natural  consequence  of  applying  the  projection 
methodology  to  noncoherent  processing — it  is  not  an  artifact  of  the  Gaussian 
approximation  technique  outlined  above.  Similar  results  are  observed  when  targets  2  and 
3  are  compared,  with  target  2  “winning”  as  the  SNR  decreases  (the  component  sum  for 
target  2  is  the  same  as  for  target  1). 
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Targets  1  and  3:  Overlap=  0.15,  Noncoherent  Processing 


Figure  10.  Pn  and  P33  vs.  SNR  Pairwise  Comparison  of  Targets  1  and  3  in  Figure  8 
Demonstrates  a  Flaw  in  our  Direct  Use  of  a  Linear  Process 


Although  the  foregoing  discussion  explains  why  the  bizarre  result  depicted  in 
Figure  10  occurs,  this  limitation  seems  rather  unsatisfying.  Therefore,  upon  further 
reflection,  we  realized  that  the  problem  actually  lay  in  our  representation  of  the  target 
templates.18  We  really  need  to  define  target  templates  that  represent  the  expected  return 
when  observing  a  target.19  Hence  the  target  templates  for  the  noncoherent  case  need  to  be 
defined  as  follows: 

A  i  =  (<  (s,!  +  »,)>,<  | sj2  +  n2  >,...,<  | sjm  +«,„|>)r-  (42) 


The  terms  in  (42)  are  the  means  of  Rician  processes  such  that  <|sjk  +  nk\>  =  f(.Sjk, 
«k)  >.Sjk,  where  fi.Sjk,  /7k)  indicates  that  the  mean  of  a  Rician  process  is  a  function  of  both 
the  signal  Sjk  and  the  noise  n\.  For  large  SNR,  <|sjk  +  «k|>  — >% ,  and  for  small  SNR,  <|.Sjk 
+  /?k|>  =  •  Obviously,  this  means  that  the  target  templates  must  vary  with  SNR.  The 


appendix  (Section  A-4)  shows  how  to  calculate  exactly  the  means  of  the  Rician  random 
variables.  While  we  have  used  this  exact  calculation  in  developing  the  ensuing 
illustration,  a  reasonable  numerical  approximation  we  have  empirically  derived  (i.e., 
within  ~  1  percent)  is  as  follows: 


<  \S  ,,  +  nt 


(s>  )21S  + 


(43) 


18  An  analogous  issue  occurs  in  noncoherent  detection  theory,  which  uses  a  square  law,  as  opposed  to  a 
linear,  detector  when  the  SNR  is  low. 

19  Without  directly  thinking  about  it  this  way,  that  is  exactly  what  occurs  in  the  coherent  case  because  the 
noise  is  zero-mean  Gaussian. 
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Based  on  these  considerations,  the  target  1  and  3  templates  for  SNR  values  of  1, 
4,  and  10  are  shown  in  Table  I. 

Table  I.  SNR-Dependent  Noncoherent  Target  Templates 


Target  1  Templates  Target  3  Templates 


SNR  =  1 


0.886  0.930  0.930  0.930  0.886 
0.886  0.886  0.930  0.886  0.886 
0.930  0.930  0.930  0.930  0.930 
0.886  0.886  0.930  0.886  0.886 


0.886  0.886  0.886  0.886  0.886 
0.982  0.982  0.911  0.982  0.982 
0.886  0.886  0.911  0.886  0.886 
0.886  0.886  0.886  0.886  0.886 


SNR  =  4 


0.443  0.528  0.528  0.528  0.443 
0.443  0.443  0.528  0.443  0.443 
0.528  0.528  0.528  0.528  0.528 
0.443  0.443  0.528  0.443  0.443 


0.443  0.443  0.443  0.443  0.443 
0.621  0.621  0.491  0.621  0.621 
0.443  0.443  0.491  0.443  0.443 
0.443  0.443  0.443  0.443  0.443 


SNR  =  10 


0.280  0.405  0.405  0.405  0.280 
0.280  0.280  0.405  0.280  0.280 
0.405  0.405  0.405  0.405  0.405 
0.280  0.280  0.405  0.280  0.280 


0.280  0.280  0.280  0.280  0.280 

0.529  0.529  0.353  0.529  0.529 

0.280  0.280  0.353  0.280  0.280 

0.280  0.280  0.280  0.280  0.280 
v-  -J 


When  processing  variable  target  templates  such  as  these  using  (37),  one  obtains 
the  results  shown  in  Figure  1 1.  As  before,  the  simulated  results  come  from  averaging  the 
results  of  100  000  trials  at  each  SNR  value. 

Using  this  revised  template  definition  process,  Figure  1 1  shows  that  the  decision 
between  targets  1  and  3  becomes  50-50  at  low  SNR  values.  Also,  the  results  parallel,  but 
fall  below,  the  curve  obtained  for  coherent  processing  with  a  target  template  correlation 
of  0.15.  Although  the  above  method  will  work  when  developing  confusion-matrix 
probabilities  for  noncoherent  processing,  the  practical  implementation  of  these  SNR- 
dependent  templates  will  necessitate  estimating  the  noise.  This  could  be  accomplished  by 
averaging  the  returns  for  elements  of  the  original  template  where  the  RCS  is  zero  (e.g., 
elements  outside  the  extent  of  the  target).  In  these  regions,  the  Rayleigh  distribution 
applies,  and  the  value  of  the  noise  parameter  can  be  observed  directly. 
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Targets  1  and  3:  Ovcrlap=  0.15,  Noncoherent  Processing,  Mean  Template 


Figure  11.  Corrected  Results  of  Pn  and  P33  vs.  SNR  with  Revised  Templates  for 

Targets  1  and  3 
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VI.  PRACTICAL  CONSIDERATIONS 


This  section  discusses  a  few  practical  issues  associated  with  implementing  the 
computations  discussed  above. 


A.  UNCERTAINTIES  ASSOCIATED  WITH  COHERENT  PROCESSING 


Section  V  began  with  a  discussion  of  a  target-processing  scheme  that  involved 
target  template  and  measurement  vectors  that  contain  complex  terms  to  represent  the 
relative  phase  between  positions  in  the  vectors.  Although  magnitude  processing  is 
insensitive  to  absolute  phase-measurement  errors,  it  still  requires  extremely  precise 
knowledge  of  the  viewing  angle  of  the  target,  as  well  as  significant  uniformity  among  the 
various  actual  targets  being  represented  by  a  target  template  vector.  Neither  of  these 
conditions  will  consistently  occur  in  practice.  For  example,  if  one  considers  the  two- 
scatterer  situation  depicted  in  Figure  12,  then  the  relative  phase  between  a  return  from 
each  of  the  individual  scatters  varies  as  follows: 


,  2  Ad  2  ©IV  A.  (l -cos©) 

AO  =  to  A t  =  ro - = - - - 

c  aX/2n 

=  4jiV(1-cos@) 


c/AO 

d& 


=  4ttN sin© . 


(44) 

(45) 


Distance  between  scatterers  =  NX 


r  "s 


Figure  12.  Longitudinal  Phase  Difference  Geometry 

For  example,  for  two  scatters  separated  by  1  meter  at  X  band  (1010  Hz),  N  = 
100/3,  so  the  variation  in  O  as  0  varies  approximately  equals  420  0  when  the  angle  0  is 
small.  Hence,  even  a  small  uncertainty  in  the  viewing  angle  0  results  in  a  large  change  in 
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the  relative  phase,  even  though  the  apparent  difference  in  the  relative  positions  of  the 
scatters  would  be  imperceptible. 


B.  TARGET  SCINTILLATION  ELLECTS 


If  a  range  bin  contains  multiple  scatterers,  then  the  received  HRR  signal  will  vary 
as  the  viewing  angle  changes.  Figure  13  shows  the  simplest  case  of  only  two  scatterers  in 
the  same  range  bin  separated  by  a  distance  of  nk.  With  respect  to  the  centroid  of  the  two 
scatterers,  the  relative  phase  from  the  individual  scatterers  is  given  by 


2  Ad  „ 

A<£  =  co  At  =  co - -2  co 


[nA/  2)sin© 
coA!2n 


-  2 n  n  sin  0  *  2k  n  0 


(46) 


so  the  received  signal  becomes 

sin  (cot  +  AO)  +  sin  (cot  -  AO)  =  [2cos(2/r/?©)]sinfttf  .  (47) 


Figure  13.  Scintillation  Geometry 

At  X  band  for  this  simple  case  of  two  scatterers  in  the  same  range  bin  separated 
vertically  by  1  meter,  the  amplitude  of  the  received  signal  varies  with  angle  as  shown  in 
Figure  14.  The  amplitude  of  the  sine  term  in  (47)  depends  greatly  on  the  angle  0. 
However,  if  two  range  bins  separated  longitudinally  by  an  arbitrary  distance  happen  to 
both  have  two  scatterers  separated  by  1  meter,  then  the  relative  difference  in  magnitude 
between  these  two  range  bins  would  not  vary  with  the  angle  0.  Of  course,  as  the 
amplitudes  approach  zero  (i.e.,  [2  cos(2  n  n  0)]  -»  0),  one’s  ability  to  recognize  that  the 
returns  are  equal  will  disappear  as  noise  becomes  comparable  in  magnitude  to  the  signal. 

C.  EFFECT  OF  TOTAL  TARGET  RETURN  ON  NOISE  VALUES 

The  above  analyses  assumed  unit  target  template  vectors  (i.e.,  SxHSx  =  1).  This 
was  done  so  the  relative  angles  between  the  template  vectors  would  provide  a  direct 
measure  of  the  degree  of  independence  between  the  individual  target  templates. 
However,  this  also  means  that  all  the  target  templates  used  in  building  the  confusion 
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Angle,  ©,  from  Parallel  to  Centerline  (degrees) 


Figure  14.  Variation  in  Magnitude  of  Return  as  Angie  Changes 


matrices  assume  that  all  targets  are  equally  bright  (or  dim).  This  effect  matters  most  when 
thinking  about  the  various  rows  in  the  confusion  matrix  that  correspond  to  a  priori 
assumptions  that  each  of  the  various  targets  is  being  observed.  In  reality,  there  can  be 
substantial  variability  in  the  total  brightness  of  targets  (e.g.,  transport  aircraft  vs.  cruise 
missiles).  When  making  a  target  observation,  we  will  observe  Zx,  where  Zx  =  Tx+  N. 
While  we  can  compute  |  Zx  |  directly,  we  would  really  like  to  detennine  \TX\.  However, 
if  we  can  estimate  the  amount  of  noise  (which  we  might  do  by  observing  returns  in 
components  of  ZX  without  any  target  signal  present),  then  we  could  estimate  |  Tx  |.  With 
this  we  could  then  form 


7' 

^  v 


Tx+N 


■  Sx  + 


N 


(48) 


where  Sx  is  now  the  unit  target  template.  From  (48)  we  see  that  the  noise  portion  of  Z'x 
varies  with  the  brightness  of  the  target.  As  the  magnitude  of  the  noise  varies,  the  entries 
in  the  confusion  matrices  vary  (e.g.,  see  Figure  7  or  11).  The  details  of  how  to  address 
this  effect  are  outside  the  scope  of  the  material  we  wish  to  address  in  this  paper. 


D.  EVALUATION  OF  TARGET  CLASSES  AND  TARGET  FLUCTUATIONS 

The  above  formulations  have  considered  that  each  target  will  have  a  unique  target 
template.  However,  situations  might  arise  such  as  the  one  depicted  in  Figure  15,  in  which 
groups  of  targets  appear  highly  correlated,  but  the  correlation  between  the  target  groups 
is  small.  For  example,  one  target  group  could  involve  two-engine  fighter  aircraft  (e.g.,  F- 
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15,  F-18,  Mig-29)  while  another  could  involve  two-engine  commercial  aircraft  (e.g.,  B- 
757,  B-767,  A-310).  In  this  case,  one  could  fonn  a  composite  template  for  each  of  the 
target  groups  and  then  compare  measured  targets  against  each  of  the  groups.  A  simple 
way  to  do  this  would  be  to  assume  increased  noise  associated  with  the  target 
measurements  so  as  to  include  both  the  actual  noise  and  a  dispersion-effect  noise  that 
addresses  the  variation  among  targets  within  a  given  class,  as  shown  in  Figure  16.20  A 
subset  of  this  problem  might  involve  grouping  the  various  models  (e.g.,  F-15C,  F-15E)  or 
configurations  (e.g.,  with  or  without  stores)  of  an  aircraft  into  a  composite  aircraft 
category  (e.g.,  F-15).  This  would  likely  result  in  less  dispersion  within  such  a  generic 
airframe  category  than  would  occur  across  a  broader  class  of,  say,  all  two-engine  fighters. 
The  actual  or  synthesized  target  template  data  and  the  variations  in  the  individual  target 
template  dot  products  among  the  various  targets  will  have  to  be  examined  to  detennine 
which  target  groupings  make  most  sense. 

In  a  similar  manner  to  that  shown  in  Figure  16,  one  could  increase  the  values  for 
the  terms  in  the  target’s  covariance  matrix  to  account  for  target  fluctuations.  In  this  case 
the  terms  in  the  noise  covariance  matrix  would  correspond  to  the  sum  of  the  squares  of 
standard  deviations  for  both  the  inherent  system  noise  (e.g.,  thermal  noise)  and  the  target 
fluctuations. 


Figure  15.  Formation  of  Target  Class  Templates 


Because  there  will  typically  be  fewer  targets  in  a  class  than  components  in  the  template  vectors,  the 
effective  noise  for  the  class  will  no  longer  appear  as  a  hyper-sphere  (despite  the  depiction  in  Figure 
15).  Hence  the  noise  covariance  matrix  associated  with  the  target  class  will  become  a  hyper-ellipse, 
which  would  necessitate  modifications  to  the  formulations  shown  in  Chapter  V,  beginning  with  (18). 


VI-4 


Figure  16.  “Noise”  for  Target  Class 

E.  EVALUATION  OF  NON-ISOTROPIC  NOISE 

When  the  noise  covariance  is  non-isotropic,  the  results  presented  in  Section  5 
require  modification.  The  noise  could  be  non-isotropic  because  the  nominal  target  is  a 
composite  of  a  number  of  similar  targets,  as  just  illustrated  in  Figure  15,  or  due  to  the  use 
of  noncoherent  processing.  For  example,  with  noncoherent  processing  when  the  target 
template  is  (  1,  0  )T  and  the  SNR  equals  1,  the  expected  template  is  (  1.545,  1.253  )T  and 
the  corresponding  standard  deviations  are  0.783  and  0.655.  So  clearly  noncoherent 
processing  can  lead  to  ellipsoidal  rather  than  spherical  noise  matrices.  Figure  17  shows  a 
generic  depiction  of  this  situation. 

The  test  statistic  (14)  includes  a  noise  covariance  matrix  that  will  therefore  adjust 
for  non-isotropic  noise.  The  Fisher  Linear  Discriminant  [8]  provides  an  alternative  way 
to  think  about  the  problem.  It  allows  one  to  determine  the  maximum  separation  between 
two  data  sets  based  on  the  separation  between  the  means  of  the  two  populations  and  the 
combined  covariance  of  the  two  populations.  The  Fisher  Linear  Discriminant  is  given  by: 

U  =  c;'(»il  -m2)  (49) 

where: 

Vs  is  the  vector  for  maximum  statistical  separation 

Cc_1  =  Inv(Cc)  where  Cc  =  Cj  +  C2 

Ci  is  the  covariance  associated  with  population  1 
C2  is  the  covariance  associated  with  population  2 
ni\  is  the  mean  of  population  1 
m2  is  the  mean  of  population  2 
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Figure  17.  Non-isotropic  Noise  Case 

The  Fisher  Linear  Discriminant  uses  the  inverse  of  the  combined  covariance 
matrix  to  rotate  the  vector  between  the  population  means  to  produce  maximum  statistical 
separation. 

Our  problem  involves  deriving  the  entries  in  a  confusion  matrix.  For  each  row  of 
the  confusion  matrix,  the  true  target  is  presumed  to  be  known.  Therefore,  we  would  have 
to  modify  the  Fisher  Linear  Discriminant  process  slightly  for  our  needs.  Specifically, 
when  evaluating  a  row  of  the  confusion  matrix,  only  one  covariance  matrix  applies  (the 
one  for  the  presumed  target).  The  means  could  then  be  the  locations  of  the  mean  for  the 
presumed  target  and  the  locations  of  the  means  for  each  of  the  other  targets  included  in 
the  confusion  matrix.  A  number  of  such  pair-wise  decisions  would  essentially  put  a 
hyper-box  around  the  presumed  target  and  the  probability  of  being  within  that  hyper-box 
would  define  the  diagonal  term  on  the  confusion  matrix.  The  sides  of  the  hyper-box 
would  be  defined  by  the  separation  vector,  Vs,  but  a  constant  would  need  to  be  set  to 
define  the  location  of  each  of  these  hyper  planes.  The  constant  could  be  defined  so  the 
plane  passes  through  the  midpoint  of  the  line  separating  the  target  means.  Alternatively,  a 
no-declare  buffer  region  could  be  established  between  each  target  pair  to  reduce  the 
chances  of  false  reports.  The  off-diagonal  terms  would  be  found  by  calculating  the 
probability  that  the  observed  return  for  a  target  would  be  outside  the  hyper-box  and 
closer  to  the  other  targets. 

F.  DISCRIMINATION  AMONG  TARGETS  WITHIN  A  CLASS 

One  might  wish  to  implement  a  two-stage  process  in  which  a  target  is  first 
evaluated  against  a  class  (to  take  advantage  of  the  potentially  better  orthogonality  among 
class  templates)  and  then  determine  the  specific  target  type  within  a  class.  The  prior 
section  described  the  approach  to  the  first  stage,  but  a  different  process  appears  warranted 
for  the  second  stage.  The  idea  here  is  to  remove  as  much  of  the  common  information 
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from  the  individual  vectors  as  possible  and  then  test  for  differences  among  the  resulting 
modified  vectors.  Figure  18  depicts  this  process.  The  left  part  of  the  figure  shows  the 
original  arrangement  of  the  target  vectors  that  make  up  the  target  class,  the  composite 
vector  used  to  represent  the  entire  class,  and  the  error  ellipse  due  to  sensor  noise  for  one 
of  the  target  vectors.  The  right  part  of  the  figure  shows  the  resulting  vectors  after  the 
composite  vector  has  been  subtracted  from  each  of  the  individual  vectors.  The  resulting 
modified  vectors  have  clearly  become  more  orthogonal.  However,  if  the  modified  vectors 
are  re-normalized,  then  the  noise-error  ellipse  will  become  larger  as  also  shown  in  the 
figure. 

This  process  has  made  the  problem  of  discrimination  among  the  individual  targets 
within  a  class  amenable  to  the  analysis  procedures  described  above  because  of  the  near 
orthogonality  of  the  modified  vectors.  Because  of  the  renormalization  of  both  the 
modified  vectors  and  the  size  of  the  error  ellipse,  however,  the  analysis  process  may 
require  substantially  higher  SNRs  to  distinguish  individual  targets  within  a  class.  The 
feasibility  of  this  target-type  discrimination  will  therefore  depend  upon  the  degree  of 
difference  among  the  vectors  within  a  class  and  the  signal-to-noise  levels. 


Original  Relation  of  Targets 


Relation  of  Targets  Vectors  alter  Subtracting 
the  Target  Class  Vector  from  Each 


Figure  18.  Effect  of  Removing  the  Target  Class  Vector 

G.  DEALING  WITH  A  MIX  OF  HIGHLY  CORRELATED  AND  NEARLY 
UNCORRELATED  TARGETS 

One  might  sometimes  encounter  a  situation  similar  to  the  one  depicted  in  Figure 
19.  In  this  case,  two  of  the  targets  have  nearly  aligned  template  vectors  (and  are  therefore 
highly  correlated),  but  the  third  has  a  nearly  orthogonal  template  vector  (and  is  therefore 
nearly  uncorrelated  with  the  other  two).  The  analysis  procedure  described  in  Section  V 
generally  assumes  that  the  individual  target  vectors  are  uncorrelated,  that  is,  f(x,  y,  z)  = 
flx)-f{y)-f{z).  In  this  case,  which  involves  a  mix  of  highly  correlated  and  nearly 
uncorrelated  targets,  the  joint  probability  distribution  would  only  partly  factor,  such  that 
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fix,  v,  z)  =  fix,  y)-fiz).  Therefore,  in  this  case  the  general  equation  (3)  could  be  rewritten 
as 


P(X  |  Tj)  =  Pix  >  y  and  x  >  z) 

CO  X 

=  \dx  1  fTiix,y)FTiix)dy 

—00  —00 


(50) 


because 


f  fTi  (■ x,y)fT.  (z)dz  =  fT  (x,y)FT  (x) . 

J— oo 


Following  dot 
products  apply: 


x  y 

TX  TZ*  0 
Ty-Tz*  0 


(51) 


Figure  19.  Mix  of  Highly  Correlated  and  Uncorrelated  Targets 

This  procedure  may  prove  useful  if  the  majority  of  the  targets  are  uncorrelated 
and  only  a  few  are  highly  correlated.  The  following  provides  a  formulation  for 
representing  the  joint  density  of  two  partially  correlated  Gaussian  random  variables: 


f(x,y)  =  - 


1 

f 

1 

2 

( x-mx )2  2  r{x-mx\y-my)  ( y-my ) 

1 - exp 

27tGxGy\  \  -r 

r 

1 

+  J, 

(52) 


where  mx,  ax,  my,  and  gv  are  the  mean  and  standard  deviation  of  the  random  variables  x 
and  v  and  r  is  the  correlation  between  the  two  random  variables.  This  is  actually  the  two- 
dimensional  version  of  the  multidimensional  Gaussian,  (25),  given  previously. 


H.  EFFECT  OF  DIMENSIONAFITY  OF  TARGET  TEMPEATE  VECTORS 

The  dimensionality  of  the  target  vectors  affects  the  ability  to  differentiate  between 
targets  for  the  noncoherent  case  for  the  reasons  described  below.21  To  address  this  issue, 
consider  the  simplest  case  of  target  vectors  S i  =  (1  0)r  and  53  =  (a  b)T. 


21  The  goal  here  is  to  offer  a  simple  demonstration  of  a  potential  problem  rather  than  to  provide  a 
rigorous  analysis  involving  the  use  of  SNR-dependent  target  templates  as  discussed  in  Chapter  V.C. 
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Making  IS2I  =  1  requires  a  <  1  and  b  =  yll-a2  .  The  correlation  between  these 
vectors  is  S1S2  =  a,  and  the  magnitude  of  their  difference  is  \S\  -  Sy  =  ^2(\  -  a) . 

Assuming  the  noise  value  is  s  «  1,  the  first  component  of  the  expected  measurement 
vector  of  target  1,  that  is,  <  Z\  >,  can  be  approximated  by  1,  and  the  second  component  is 
given  by  s  =  Jfcr  because  there  is  no  signal  in  this  measurement  bin  and  the  mean  of  the 

Rayleigh  distribution  applies.  Hence,  <  Zx  >«  (iff .  In  this  case  <Zl>TSl=  1  and 
<Z[  >T  S2  =a  +  eyll-a2  (noncoherent  case),  so  the  dot  product  between  the  latter  two 

vectors  is  a  function  of  the  SNR.  Also  note  that  for  coherent  processing,  <  Z\  >  =  (1  Of, 
such  that  <ZX>T  Sx=  1  and  <  Zx  >T  S2  =  a  (coherent  case). 

Hence,  the  dot  product  between  the  expected  measurement,  <  Z\  >,  and  the 
template  Si  varies  with  SNR  for  the  noncoherent  case  but  is  invariant  with  SNR  for  the 
coherent  case. 

Now  to  consider  higher  dimensions  (i.e.,  n  dimensions),  assume  the  following 
vectors: 

Si  =  (l  00  00... Of  and  S2  =  (abb  bb...bf,  where  b=  f  l-«2)/(«-l) .  (53) 

As  before,  let  <  Z\  >  be  the  expected  return  when  target  Si  is  present.  This  yields 

<  Z\  >  =  (1  0  0  0  0. .  .Of  for  coherent  processing  and 

<Z  1  >  =  ( 1  s  s  s  s. . .  s)T  for  noncoherent  processing. 

For  coherent  processing  the  dimensionality  does  not  matter  if  the  degree  of 
correlation  between  the  templates  remains  fixed,  since 

<  Z,  >T  Sx  =  1  and  <  Z,  >T  S2  =  a  (coherent  case).  (54) 

However,  for  noncoherent  processing,  one  obtains 

<ZX>T  S{=  1  and  <  Zj  >T  S2  =  a  +  l-a2)(«-l)  .  (55) 

Figure  20  shows  plots  of  (55)  when  a  =  0.6  and  s  =  0.1.  This  value  of  £ 
corresponds  to  an  SNR  of  l/2a2  =  2571.  The  upper  line  corresponds  to  the  first  dot 
product,  which  is  invariant  with  the  number  of  dimensions.  The  second  line,  which 
corresponds  to  noncoherent  processing,  increases  as  the  number  of  dimensions  increases. 
This  means  that  the  likelihood  of  incorrectly  recognizing  target  Z\  as  target  Si  would 
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increase  as  the  number  of  dimensions  increases.22  Of  course,  the  reason  for  increasing  the 
number  of  dimensions  in  the  target  vectors  would  be  to  enhance  the  ability  to 
discriminate  between  two  targets.  Figure  21  shows  how  much  decorrelation  between  the 
targets  would  be  required  to  keep  the  chances  of  confusing  target  Z\  with  target  Si 
constant  when  performing  noncoherent  target  processing.  The  lower  correlation  curve  in 
Figure  21  begins  at  a  correlation  of  0.6  for  a  two-dimensional  case  and  falls  to  0.057  for  a 
40-dimensional  case.  However,  this  reduced  template  correlation  would  lead  to  reduced 
target  confusion  if  one  were  to  perform  coherent  processing.  In  fact,  the  dot  product  for 
the  coherent  case  would  follow  the  lowest  curve  in  Figure  21  (i.e.,  s'  S2  =<  Zx  >T  S2  in  the 

coherent  case).  Of  course,  the  relationships  depicted  in  Figs.  20  and  21  would  change  for 
different  SNR  assumptions  (i.e.,  different  values  of  s)  and  target  vectors  different  from 
those  assumed  in  (53),  and  the  vectors  in  (53)  no  doubt  represent  a  worse  case.  For 
example,  consider  the  following  noncoherent  case: 

51  =  (  1  0  1  0  1  0...)r  Jlfn  (56a) 

52  =  (  0.6  0.8  0.6  0.8  0.6  0.8...) T  (56b) 

< Z\  >  ~  (  1  s  1  e\  s...)T  y/2/n  (56c) 

where  the  number  of  components  in  each  vector  is  assumed  to  be  even.  In  this  case,  the 
correlation  between  the  two  templates  is  given  by  S i  1  Si  =  0.6,  and  the  test  statistics 
become  <  Z\>  S\  =  1  and  <  Z\>  Si  =  0.6  +  0.8  s.  Therefore,  although  in  this  case  the 
chance  of  confusing  a  measurement  of  target  1  with  target  2  does  not  depend  on  the 
dimensionality  of  the  vectors,  it  still  depends  on  the  size  of  the  noise  terms. 

Both  the  cases  shown  in  the  figures  and  the  numerical  example  involving  (56) 
demonstrate  a  fundamental  difference  between  the  coherent  and  noncoherent  cases  and  a 
need  for  caution  when  considering  noncoherent  processing. 


22  The  reason  that  the  Z\*S2  curve  exceeds  one  is  that  our  methodology  does  not  include  normalization  of 
the  measured  vector,  Z\.  Also,  because  we  are  evaluating  the  noncoherent  case,  we  can  drop  the 
magnitude  so,  for  instance,  |<Z1>-iS2]  =  <Z\>-S2. 
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Figure  20.  Effect  of  Dimensionality  on  Dot  Products  (Constant  Correlation) 


0  10  20  30  40  50 

Number  of  Terms  in  Target  Vector 


Figure  21 .  Effect  of  Variable  Correlation  on  Vector  Dot  Products 
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VII.  CONCLUSIONS 


This  paper  provides  a  basis  for  the  generation  of  confusion  matrices.  It 
demonstrates  how  to  use  target  templates  to  derive  traditional  square  confusion  matrices 
that  have  identical  row  and  column  labels  and  how  to  generate  rectangular  matrices  that 
typically  include  a  no-declare,  or  no-decision,  column.  The  paper  suggests  alternative 
criteria  for  generating  the  no-declare,  or  no-decision,  column.  The  framework  for 
generating  confusion  matrices  presented  herein  could  help  ID  system  developers  assess 
issues  such  as  the  following: 

•  What  are  the  relative  advantages  of  one-dimensional  versus  two-dimensional 
(or  three-dimensional)  target  imaging? 

•  What  are  the  relative  benefits  of  higher  versus  lower  resolution 
measurements  (e.g.,  one-dimensional  HRR  profiles  that  use  higher 
bandwidth  to  achieve  more  samples  along  the  length  of  a  target)? 

•  Because  one-dimensional  HRR  profiles  will  vary  depending  upon  the 
viewing  angle  of  the  target,  what  are  the  trade-offs  involved  in  having 
separate  templates  for  a  few  or  a  large  number  of  different  viewing  angles? 
(For  example,  the  target  returns  within  a  narrow  set  of  viewing  angles  will 
likely  be  highly  correlated,  while  those  within  a  wider  set  of  angles  will 
begin  to  become  uncorrelated,  but  will  likely  have  greater  variability  due  to 
scintillation  effects.) 

•  How  worthwhile  is  it  to  preserve  phase  information  when  generating  target 
templates  and  making  target  measurements? 

We  address  the  first  of  these  issues  through  a  conventional  approach  of  treating 
any  target  (template  or  measurement)  as  a  vector  of  data  points  regardless  of  whether  the 
process  involves  one-dimensional,  two-dimensional,  or  higher  diagonal  target 
measurements.  These  vector  representations  provide  a  convenient  way  of  understanding 
and  evaluating  the  degree  of  correlation  between  targets,  which  has  been  shown  to  be 
directly  related  to  the  inner  product  of  the  vector  representations  for  the  various  targets 
(assuming  the  vector  representations  have  been  normalized  to  one).  Moreover, 
representation  of  one-,  two-  (or  higher)  dimensional  targets  as  vectors  leads  to  the 
calculation  of  the  conditional  probabilities  appearing  in  the  confusion  matrices. 
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The  paper  considers  coherent  and  noncoherent  processing  and  correlated  and 
uncorrelated  targets.  For  the  case  of  coherent  processing  of  uncorrelated  targets,  the 
paper  demonstrates  the  process  for  generating  universal  confusion  matrices — that  is, 
matrices  that  do  not  depend  upon  the  details  of  the  targets  so  long  as  the  targets  are 
uncorrelated.  One  winds  up  obtaining  Rician  random  variables  because  of  the  need  to 
compare  the  magnitudes  of  the  test  statistics  (i.e.,  R,k  =  7^1).  However,  because  the 
individual  test  statistics  are  uncorrelated,  the  required  joint  probability-distribution 
factors  are  easily  obtained  by  multiplying  the  probability  distributions  associated  with  the 
individual  test  statistics.  Although  evaluation  of  cases  involving  uncorrelated  targets  and 
coherent  processing  is  admittedly  more  of  a  theoretical  than  a  practical  case,  this 
calculation  provides  insight  into  the  upper  limit  of  ID  perfonnance  one  can  achieve  as  a 
function  of  the  SNRs  associated  with  measured  targets. 

We  have  illustrated  the  reduced  likelihood  of  being  able  to  identify  targets  as  they 
become  partially  correlated.  In  fact,  Figure  7  shows  that  there  is  very  little  error  in 
analyzing  targets  as  if  they  are  uncorrelated  if  their  actual  correlation  is  less  than  50 
percent.  To  address  partial  target  correlation,  one  requires  the  joint  probability  functions 
for  partially  correlated  Rician  random  variables. 

The  paper  demonstrates  by  means  of  simulation  that  the  conditions  under  which 
the  multidimensional  Gaussian  distribution  adequately  approximates  the  required  joint 
probability  function.  In  those  cases  where  the  Gaussian  approximation  would  not  hold,  it 
would  always  be  possible  to  construct  the  joint  PDF  using  numerical  techniques. 

The  paper  also  provides  a  methodology  for  performing  noncoherent  target 
processing.  These  latter  cases,  involving  correlated  targets  and  noncoherent  processing, 
require  application  of  the  central  limit  theorem  (i.e.,  normal  approximations)  to  the 
Rician  distribution.  The  correlated,  noncoherent  target  cases  reveal  an  interesting  bias 
toward  the  declaration  of  targets  that  are  closer  to  having  a  signature  unifonnly 
distributed  over  the  measurement  space  as  the  SNR  falls.  This  occurs  if  we  simply  apply 
our  linear-statistic  approach  (16a)  without  modification  when  the  SNR  is  low.  However, 
if  the  templates  for  the  noncoherent  case  are  adjusted  as  a  function  of  the  SNR  to  reflect 
the  expected  return  from  a  target,  then  this  target-selection  bias  goes  away.  The  paper 
further  suggests  how  our  process  could  be  refined  to  deal  with  non-isotropic  noise 
associated  with  either  noncoherent  processing  or  composite  target  templates 
corresponding  to  a  target  class.  Also,  we  have  shown  that  the  noncoherent,  correlated 
case  can  actually  get  worse  as  the  number  of  elements  in  the  measurement  vector 
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increases,  unless  the  added  dimensionality  leads  to  a  calculable  reduction  in  the 
correlation  between  the  targets. 

Finally,  the  vector  representation  of  target  templates  and  measurements  provides 
a  useful  way  to  think  about  the  issues  associated  with  making  target  class  versus  target 
type  decisions. 
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APPENDIX— PROOFS 


This  appendix  provides  proofs  of  the  mathematical  claims  made  in  the  text. 
Throughout  the  appendix  the  superscript  H  is  used  to  denote  the  Hermitian  conjugate 
(complex  conjugate  transpose)  of  a  complex  vector  or  matrix. 


A-l.  Derivation  of  (8) 

While  (8)  may  be  obvious,  the  following  few  equations  further  demonstrate  why 
it  is  true.  For  the  threshold  condition  Xk  >  a ,  the  probability  of  observing  Xk,  given  true 
target  tb  is  given  by  (5)  with  the  lower  limit  on  the  final  integral  over  xa  replaced  by  a: 


P{Xk\Tt)  =  \fXt{xk)dxk 


7=1 


Fxjxk) 


(Al) 


Next,  examine  the  sum  the  sum  of  P(Xi,\U)  over  all  target  templates  Xk\ 

ST'  fxt  ( xk  ) 

*XjKxk) 

7=1 


n  w  n  n 

XP(WIT)  =  in* 

k=  1  n  7=1  k= 1 


k~tFxk(xk) 


-dxt . 


(A2) 


Now  recognize  that  the  integrand  can  be  written  as  the  derivative  of  the  product 
of  all  cumulative  functions,  that  is, 


dxk 


j= 1  7=1  A-=l  PXt(xk) 


(A3) 


It  follows  that 


72  w  72 

2>(WIT)  =  p  riFT(A) 

^=1  a  V 


= n  Fxj  (°°) _  n  fx/«) = 1  ii  ,  («) 

7=1  7=1  7=1 


(A4) 


where  we  have  used  the  fact  that  cumulative  probability  functions  are  unity  at  infinity. 
The  sum  over  all  states,  including  the  no-declare  state,  must  be  unity,  that  is 

72 

Yjr(Xk\Ti)+P(ND\Ti)  =  \.  (A5) 

k= 1 
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From  this  and  (A4)  one  arrives  at  (8)  in  the  paper, 

n 

P(ND\Ti)  =  Y\Fxia) .  (A6) 

j= i 

Setting  a  =  -co  restores  the  no-threshold  result  P(ND |f)  =  0,  as  required. 

A-2.  Vector  Relations 

In  this  section  we  derive  several  statistical  results  concerning  template,  noise,  and 
signal  vectors,  and  we  point  out  results  that  are  mentioned  in  the  paper. 

Let  S  be  an  //-dimensional,  complex-valued  template  vector  with  nonn  1 : 

S a  —  xa  +  iy a  ,  cc  —  1,2,'  •  *,  // 

\S\2=SHS  =  Yi(<x2a+y2a)  =  \ 

a= 1 

Let  N  be  an  /7-dimensional,  complex-valued  noise  vector  whose  real  and 
imaginary  components  are  independent  zero-mean  Gaussian  random  processes  with 
standard  deviation  cr. 

Na  =ua  +  iva  ,  a  =  1,2, •••,/? 

<  Ua  >=<  va  >=  0 

<  uaup  >=<  vavp  >=  cr28ap 
<uav p  >=  0,  for  all  a  and  /? 

Let  Z  be  a  complex-valued,  /7-dimensional  signal  vector  given  by  Z  =  S  +  N . 

Some  useful  relations  follow. 

a.  <  |iv|2  >=<  NhN  >=  2/7 cr 2  . 

Proof: 

<NhN>=Yj<  N*aNa  >=^<//2  +v2  >  =  2/7  cr 2  . 

a= 1  a= 1 

b.  Cov(Ar)=<  NNh  >=  2a1 1 ,  where  I  is  the  /7-dimensional  identity  matrix.  This  is 
(12)  in  the  paper. 

Proof: 

<  NaN/3  >  =  <  K  +  iva)(Ufl  ~iVp)>=<  UaUp  +  VaVp  >  =  'Ic,2  8 ap  . 

C.  <\shn\2  >  =  2a1 . 
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Proof: 


<  \sH >  =  <  ( ShN)(ShN )*  >  =  <  ShNNhS  >  =  SH  <  NNh  >S  =  2a2. 

d.  <|Z|2  >=1  +  2 na2. 

Proof: 

<|z|2  >  =  <|S  +  Aj2  >  =  <(SH  +Nh)(S+N)>  =  <ShS  +  ShN  +  NhS  +  NhN> 

=  1 +  SH  <N>  +  <NH  >S+<  NhN  >=  1  +  2»  cr2 . 

e.  <ZZ"  >=SSH  +2cr2/  . 

Proof: 

<zz"  >  =  <(S'  +  A%S'  +  A0//  >  =  <  SSH  +SNH  +NHS  +  NNH  >  =  SSH  +2cr2I . 

f.  <|1S'"z|2  >  =  1  +  2cr2  . 

Proof: 

<  \sH z|2  >  =  <  (SHZ)(SHZ)H  >  =  SH  <  ZZH  >  S  =  1  +  2cr2  . 

A-3.  Rayleigh  Probability  Density  Function 

Here  we  show  that  the  amplitude  of  the  components  of  the  noise  vector  N  have  a 
Rayleigh  PDF.  Consider  a  component  of  N  given  by  u  +  iv.  Since  u  and  v  are 
independent,  zero-mean  Gaussians,  their  joint  PDF  is  given  by  the  two-dimensional 
Gaussian: 

(«2+v2) 

/,(“’v)  =  we  2ct"  •  (A7) 

Switching  to  polar  coordinates  and  accounting  for  the  Jacobian,  one  obtains  an 
element  of  probability  in  terms  of  the  joint  PDF  expressed  in  polar  coordinates, 

r2 

pP{r,0)rdrd0  =  —^e  2a~  rdrdO  .  (A8) 

2  TIG  ^  ' 

Integrating  over  all  9  gives 

PRAY(r)dr  =  ^pF(r,0)ddrdr  =  -^e  2a~  dr  (A9) 
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where  we  identify  />rayO  as  the  Rayleigh  PDF.  It  is  straightforward  to  show  that 
<  r  >=  Jfcr  and  <  r2  >=  2a2 ,  from  which  the  standard  deviation  is  given  by 

yl<r 2  >  -  <  r  >2  =  ^2-y<7  . 

A-4.  Rician  Probability  Density  Function 

In  this  section  we  prove  that  the  amplitude  of  each  component  of  the  signal  vector 
Z  has  a  Rician  PDF.  We  also  present  some  useful  properties  of  the  Rician  PDF.  Consider 
a  component  of  Z  =  S  +  N,  given  by  (x  +  iy)+(u  +  iv)  =  a  +  ib.  Similar  to  the  above 
analysis,  the  joint  PDF  of  a  and  b  is  given  by 

(a-xf+(b-y)2 

/>(«,£)  =  2-  .  (A10) 


Letting  a  =  re  os  ft  b  =  rsinft  x  =  dsinft  y  =  dsinft  an  element  of  probability  in 
polar  coordinates  (r,ft)  becomes 

(r2+A2-2rAcos(O-0)) 

2a-  rdrdO  .  (All) 


pP(r,9)rdrdO  =  — x—e 

27TCJ 


Integrating  over  all  ft  gives 


*£71 

Pmdr)dr  ~  J  Pp{r,9)d9rdr 
) 

1 


(r2+A2)  rAcos(6-(/)) 


(A12) 


2ncj 


dOdr 


Since  the  ft  integral  is  over  all  angles,  it  is  independent  of  the  angle  (p  and  is  given 

by  [9] 

rA  cos (0-^) 

-  d0=2n  /0(^y)  (A13) 

where  Io  is  the  zeroth-order  modified  Bessel  function  of  the  first  kind.  Finally,  the  Rician 
PDF  is  given  by 

( P+A 2) 

Pr jc(r)  =  $e  2-  Io0).  (A  14) 

Note  that  for  A  =  0,  the  Rician  density  reduces  to  the  Rayleigh  density  appearing 
in  (A9). 

The  mean  value  of  the  Rician  is  found  from  Mathematica  to  be 

+  +  (A  15) 
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where  I\  is  the  first-order  modified  Bessel  function  of  the  first  kind.  The  mean  can  also  be 
expressed  in  terms  of  a  confluent  hypergeometric  function  [10].  The  Bessel  function 
representation  lends  itself  to  obtaining  expressions  for  the  mean  value  for  large  and  for 
small  values  of  A  for  high  and  low  SNR.  When  A/cr>  I  the  modified  Bessel  functions  can 
be  approximated  by  their  asymptotic  expansions  for  large  arguments  [5]: 


7°W~ 

(A  16a) 

(A  16b) 

Using  these  expressions  in  (A  15),  one  finds  an  expression  for  the  mean  of  a 

2 

Rician  for  high  signal  to  noise  to  order  (cr/4)‘: 

<r>*A(  l  +  fjr). 

(A  17) 

For  low  signal  to  noise  (A/a<  1),  the  Bessel  functions 

can  be  expanded  around 

zero  [9]: 

I0(x)  =  1  +  — +  ■■■ 

(A  18a) 

,  ,  ,  V  X3 

,W“2+l6+''' 

(A  18b) 

After  expanding  the  exponential  in  (A  15),  substituting  the  approximations  (A  18) 
into  (A  15),  and  retaining  terms  to  order  A2/ cf,  one  finds 

(A19) 

2 

The  second  moment  of  the  Rician  is  computed  easily  from  the  definition  of  r  : 

<  r2  >=<a~  +  b 2  >=<  (x  +  u)2  +  (y  +  v)2  >=<  A2  +  2(xu  +  yv )  +  u 2  +  v2  >  (A20) 

where  the  averages  are  performed  over  the  Gaussians  u  and  v.  Only  the  even  terms  in  u 
and  v  survive,  giving 

<r2  >- A2  +2a"  .  (A21) 

If  the  parameter  A  has  been  normalized  to  one,  then  <r2  >=1  +  2 a2 ,  as  appears  in 
(19)  in  the  text.  The  variance  of  the  Rician  is  given  by 

<Tric  =<  r2  >  -  <  r  >2  .  (A22) 

For  high  signal  to  noise  the  variance,  through  (A22),  (A21),  and  (A  17),  becomes 

°ric  ~  0-2 (1  - (A23) 

valid  to  order  (cr/A)  . 
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Finally,  it  is  useful  to  find  conditions  under  which  the  Rician  PDF  can  be 
approximated  by  a  Gaussian  PDF.  Starting  with  the  Rician  PDF  in  (A  14)  and  using  the 
asymptotic  expansion  of  /o  for  large  arguments  (A  16a),  one  has  to  leading  order 


y°Ric(r)  ~  J~A 


Jljra 


Q--AY 

2a2 


(A24) 


The  term  in  parentheses  is  recognized  as  a  Gaussian  with  mean  A  and  standard 
deviation  cr.  When  cr  is  sufficiently  small,  the  density  is  appreciable  only  near  r  =  A;  in 
this  event,  the  prefactor  Jr/A  is  of  order  unity,  and  the  density  is  approximately 

Gaussian.  Recapping,  the  conditions  for  the  Rician  PDF  to  be  approximated  by  a 
Gaussian  are  <j!A  «  1  and  r  ~  A. 


A-5.  Coherent  Processing  Results 


In  this  section  we  derive  the  main  coherent  processing  results  mentioned  in  the 
text.  We  start  with  the  covariance  matrix  result  stated  near  the  conclusion  of  Section  IV 
[after  (17)].  For  each  true  target  St,  fonn  the  signal  vector  Z,  =  S,  +N  and  the  n  complex 
random  variables  Tik  =  Sk  Zt  k  =  1,2,...,/?.  The  covariance  matrix  is  given  by 


CJk  =((Tir<  Ty  >)\Tik-<Tik>)). 


(A25) 


Now,  <  Tik  >=<  sj Zt  >=<  sj Sj  +  sj  N  >=  sj* St ,  and  similarly  <  A  >=  S  'Sr  There¬ 


fore, 


Cjk  =((SfN)\SkN))  =  S k  <NNn  >S /  =  2cr~Sk  S 


H  ^  \t\tH 


2  VH  { 


(A26) 


where  the  last  equality  follows  from  relation  A-2b  above.  Note  that  the  covariance  matrix 
is  independent  of  the  true  target  St .  If  the  targets  Sk  and  Sj  are  orthogonal,  then 

Cjk  =  2a1 8 jk  [(18)  in  the  text]. 


It  is  shown  below  that  when  the  signal  and  template  vectors  are  matched,  the 
amplitude  Rn  =|  Tu  \  is  a  Rician  random  variable  with  parameters  A  =  1  and  cr.  Then,  from 
(A21)  <  r\  >=  l  +  2<j2 .  If  the  targets  Sk  and  .S',  are  independent,  then  the  amplitudes  Rlk 
and  Rj  are  independent  with  joint  expectation  given  by 

<RljRlk>=0  +  2a2)8Jk.  (All) 


This  is  (19)  in  the  text. 

We  now  show  that,  in  general,  the  amplitudes  Rik=\Tik\  are  Rician  random 
variables.  Decomposing  Tik  into  real  and  imaginary  parts,  Tik  =P  +  iQ  ,  one  finds 
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P=Re(sfZ,)=Re(sf(,S,+JV)) 


Re(S"Sf )  +  Re  -  ,Ya )(«a  +  iva ) 


=  Re(SfSJ  +  £(Zaua +!>„). 


(A28) 


Similarly, 


Q  =  Im(sf  Z,  )=  Im (S?S, )  +  ^(Xava  -  ) . 


(A29) 


P  and  Q  are  then  real  Gaussian  random  variables;  their  means  and  variances 
follow.  Thus,  <  P  >=  Re(sj* Z,-)  and  <  0  >=  Im(.S'//Z,).  Then 


a2  =<(/>-<  P>)2>=( 


+  Yava'KXfiUfi  +  Yfiv/l) 


■^r~'  X aX p  <  uaiip  >  +XaYp  <  uaVp  > 
+  4^0  <  vaWg  >  +YaYp  <  vaVp  > 


=  <72£(Xa2+Fa2)  =  <72 


(A30) 


where  the  final  result  follows  from  the  normalization  of  Sk .  A  similar  calculation  yields 


cr2g=<(Q-<Q>)2>=a2. 


(A31) 


It  is  also  straightforward  to  verify  that  a2o  =<  (P-  <  P  >){Q-  <{?>)>=  0 ,  from 

which  it  follows  that  P  and  Q  are  independent  Gaussian  random  variables.  Hence,  the 
joint  distribution  of  P  and  Q  is 


Pj(P,Q)  =  -—e 
2  KG 


(A32) 


which  is  identical  in  form  to  (A  10).  We  conclude  that  Rik  =  \Tik  |  =  yp2  +Q2  has  the  Rician 


(W) 


fik{Rik)  =  ^e  2-2  /0(^) 


(A33) 


where  cik  =\  SkSt  \ .  This  is  (24)  in  the  text.  For  i  =  k  (perfect  overlap),  cik  =  1,  whereas  for 
independent  targets,  cik  =  0 ,  and  the  Rician  reduces  to  a  Rayleigh  PDF  (21) 


A-7 


For  correlated  targets,  the  joint  expectation  <  RyRlk  >  cannot  be  computed  exactly; 
however,  an  approximate  relation  can  be  found  to  order  a2,  as  outlined  in  the  following. 


<RyRik  >  =  <\ Sfzt  ||  sj*Zt  \>  =  <\SfziZ?Sk  \>  =  <\Sf(Si  +  N)(Si  +  N)HSk  \> 
=<\  sf  +SfS;NHSk  +SfNS?Sk  +SfNNHSk  \> 


(A34) 


Defining  D  =  sf  StSf Sk ,  E  =  s"  S,N/l  Sk ,  F  =  sfNSfsk,  and  G  =  S?NNHSk,  the 

joint  expectation  becomes 

<  R,jRjk  >  =  <  yj( D  +  E  +  F  +  G)(D  +  E  +  F  +  G)*> 

=<{\D\2  +|£|2  +|F|2  +  |G|2  +DE*  +  DF*  +  DG*  +  ED* 

i  (A35) 

+  EF*  +  EG*  +  FD*  +  FE*  +FG*  +  GD*  +  GE*  +  GF*}2  > 


In  anticipation  of  expanding  the  square  root,  we  retain  terms  up  to  quadratic  order 
in  the  noise  vector, 

J. 

<  RijRik  >~<(\D\2  +K  +  L)2>  (A3  6) 

where  K  contains  the  first-order  terms  in  N  and  L  contains  second-order  terms  in  N,  with 

K  =  DE*  +  ED*  +  DF*  +  FD*  (A37a) 

L  =  \E\2  +\F\2  +DG*  +GD*  +EF*  +FE* .  (A37b) 

Expanding  the  square  root,  the  joint  expectation  becomes 

<RijRik>«\D\<{\  +  ^{K  +  L)-^{K  +  L)2}>.  (A3  8) 

Since  K  is  linear  in  N,  the  average  over  K  vanishes.  Again,  keeping  terms  up  to 
quadratic  order  (A3  8)  becomes 

<ty!„HD|(l+^-^).  (A39) 

Before  computing  the  averages  <L>  and  <K2>,  it  is  beneficial  to  examine 
which  terms  survive  the  averaging  process.  Consider  the  term  FE*  in  <  L  > , 

<  FE*  >  =  <  (Sf  NSf1  Sk  )(Sf  SjNH Sk  )*>  =  <  sfNSkN  >  .  (A40) 

Let  the  noise  vector  be  denoted  by N  =  (rjl,r/2,---,rjll)T  .  Now,  the  average  in  (A40) 
is  an  average  over  a  quadratic  form  in  the  components  of  N, 

n 

<  S'/NSl/N  >=  Ys-ap  <  VaVp  >  (A41) 

a,p= 1 
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where  the  coefficients  Cap  depend  on  the  template  vectors  S  and  Sk .  Since  the 
components  of  N  are  independent  and  <  //„  >=  0 ,  the  only  terms  that  could  potentially  be 
nonzero  are  the  “diagonal”  tenns  <  nl>',  however,  writing  iju  =  ua  +  iva  one  has 

<  nl  >=<  u2a  -  V2a  +  2 iuaVa  >=  0  (A42) 

and  the  average  <  FE*  >  vanishes.  We  conclude  that  all  averages  containing  quadratic 
forms  in  N  or  Nn  vanish,  only  those  average  that  contain  both  N  and  NH  survive.  For  the 
average  over  <L>  the  surviving  terms  are  <|fi|2>,  <|F|2>,  <DG*>,  and  <GD*>, 

which  we  now  compute: 

<  |  E  |2>  =  <  (SfSiNHSk)(SfSiNHSk)*  >=SfStSk  <  NNH  >  SkSfSj 


=  2<j2\sfs^ ,  since  Sk  <  NNH  >Sk=  2a2sfsk  =  2cr2 .  (A43a) 

<  |  F  |2>  =  <  (Sf NS?  Sk)(Sf NSf  Sk)*  >  =  Sf  <  NNh  > 

=  2<72| SkS,\2 ,  since  sf  <  NNH  >  Sj  =  2 cr2sfsj  =  2 a2 .  (A43b) 

<  DG*  >=<(Sf  StSf1  Sk)(Sf  NNH  Sk)*  >  =  SfSiSfSkSf  <  NNH  >  Sj 

=  2(7 2{sfsi){sfsi)\sfsk)\  since  <NNH  >  =2 a2 1  .  (A43c) 

<  GD*  >=[<  DG*  >]*  =  2 cr2(SfSi)\S?Si)(SfSk) .  (A43d) 


Letting. S'"  A  =  clje'l>1 ,  .S".S,  =  cike‘^k ,  and  SHSk  =  ckje1^ ,  such  that,  for  example, 

<| E  \2>=2a2  | sfSt  \2=2cr2(SfSi)\sfSi)  =  2 a2cve~i*vciJei*v  =  2 a2c2 
<  DG *  >  =  2cr 2cijeilplicike~illitckje~iAi  =  2 <72cijcikc^ei^iJ~,l,ik~lli) 

<L>  becomes  <  L  >=<  \  E  |2>  +  <  \F  |2>  +  <  DG*  >  +  <  GD*  > 

=  2cr2(c;2  +  cfk  +  2 Cycikckj  cos {(fy  -  (j>ik  -  </>kj )) .  (A44) 

The  surviving  terms23  of  <  K2  >  are 


23  We  start  with  the  following 

<K 2  >=<  (DE*  +  ED*  +  DF*  +  FD*  )2  >=<{DE*)2  >  +  <(ED*)2  >  +  <(DF’)2  > 

+  <  ( FD *)2  >  +2  <  DE'ED *  >  +2  <  DE’DF *  >  +2  <  DE* FD"  >  +2  <  ED"DF *  > 

+2  <  ED'FD *  >  +2  <  DF*FD *  > 

and  examine  some  of  the  terms  to  demonstrate  which  terms  survive.  For  example: 

<  (i DE-f  >=  (sf  sf{s” sJfcFSjf  <  (S?N)2  >=  (sf  sj(sf  S J  <  S“NS»N  > 

but  the  term  <S^NSkN  >  is  similar  in  form  to  (A41)  and  therefore  equals  zero.  A  similar  argument 
applies  to  the  other  squared  terms.  Since  D,  E,  and  F  are  complex  scalars,  their  order  can  be 
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<K 2  >  =  2<DE*DF*  >  +2  <\D  \2\E  \2>  +2  <\D  \2\F  \2>  +2  <  ED*FD*  >  .  (A45) 

Similar  calculations  to  those  involving  <  L  >  yield 

<  K1  >  =  4 a2 CyCik(c\  +  4  +  2cjCikckj  cos(^.  -  </ik  -  <j>kj )) .  (A46) 

Noting  that  |d|  =  S,  |nw  At  |  =  cifik  ■>  substituting  (A46)  and  (A44)  into  (A3  9)  and 

simplifying,  one  finally  gets 

<  RijRik  >~CijCik  +  —  (cl  +  4  +  2cijCikekj  cos (4  -  4  -  <pkj ))  (A47) 

Zcijcik 

valid  to  order  <j  (actually  to  order  4,  since  the  cubic  terms  vanish).  This  is  (29)  in  the 
text. 

A-6.  Noncoherent  Processing  Results 

In  this  section,  we  derive  the  covariance  matrix  result  for  the  true  target  Ai  [(39) 
in  the  text].  First,  from  the  definition  of  the  signal  vector 

Yi  =\sn  +«1  +  «2  lv,lA„  +nm  l)r  (A48) 

and  it  is  clear  from  the  discussions  in  Section  A-4  that  each  component  of  Yt  is  a  Rician 
random  variable  with  PDF 

(v*2+«ft2) 

fikiyik )  =  ^-e  2ff2  /0(^)-  (A49) 

o" 

For  each  target  vector  Y,  ,  associated  with  true  target  Aj  but  compared  against 
target  template  Ak,  the  test  variables  wik  are  given  by 

m 

wik  =  AJYi  =  X akayia  ,k=\,  2,...,  n.  (A50) 

a= 1 

The  covariance  matrix  elements  are  given  by 

QjkO)  =<(wy- <  wy  >)(wik-<  wik  >)> 

=<WjWik  >-<  Wy  x  wik  > .  (A5 1) 

interchanged  such  that  <  de  ’ ed  *  >=<  dd  ’e’e  >=\d  f<|£  |2>  •  Also,  <de"fd *  >=\D\2<FE*  >=0,  which 
follows  (A41)  and  (A42).  Finally, <  ed  * df  *  >=\D  |2<  ef  *  >=  0  ,  which  follows  from  (A41)  with  N 
replaced  by  A11  or  by  recognizing  that  <  ed  'df  *  >[<  de  ' fd  *  >]*  • 
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Then 


Qjk(i)  = <A/YiAkTYi  >-<  A/Y, ■  x  AkTYi  > 

=  A/  <TT7  >  Ak  -  A /  <  Yt  X  if  >  Ak 
=  A/r«Y,Yir  >-<Yi><  if  >)Ak 

=  AjTCov(Yl)Ak  (A52) 

where  Cov(T )  is  the  covariance  matrix  for  the  signal  vector  Yl .  Since  the  components  of 
Y/  are  independent  Rician  random  variables,  the  covariance  is  diagonal  with  Rician 
variances  on  the  diagonal, 

Cov(Y,  )a/]  =  o]a8ap  (A53) 

with 

4=<)'L>-<A>2  (A54) 

where  the  mean  and  second  moment  of  the  Rician  random  variable  yia  are  given  in  (A  15) 
and  (A21),  respectively. 

Substituting  (A54)  into  (A53)  yields 

n  m 

Qjk(i)—  ja^ia^ap^kp  ~  ja^iaaka  (A55) 

a,y0=l  a= 1 

which  is  (39)  in  the  text. 
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14.  ABSTRACT 


The  reported  work  began  as  an  effort  to  understand  the  fundamental  differences  between  target  identification 
techniques  based  on  one-dimensional  high  range  resolution  radar  and  two-dimensional  range-Doppler-imaging  radar.  The 
work  evolved  into  a  more  general  statistical  methodology  using  target  templates  to  compute  the  entries  in  probability  (or 
confusion)  matrices,  typically  used  to  characterize  combat  identification  (CID)  systems.  Starting  with  a  set  of  complex¬ 
valued  vector  target  templates,  we  present  a  systematic  framework  for  generating  and  comparing  a  set  of  scalar  test 
statistics  and  estimating  their  probability  of  occurrence.  These  probabilities  form  the  entries  of  the  confusion  matrix.  We 
discuss  the  impact  of  random  noise  on  the  CID  process  and  present  the  results  of  applying  the  template/confusion  matrix 
methodology  to  a  few  simple  target  representations.  Specific  attention  is  given  to  examining  the  relative  issues  associated 
with  coherent  and  noncoherent  target  identification.  For  both  coherent  and  noncoherent  cases,  approximate  methods  are 
developed  to  permit  efficient  computation  of  the  confusion  matrix  entries,  and  the  conditions  for  which  these 
approximations  are  valid  are  discussed.  Finally,  we  discuss  several  real-world,  practical  issues  associated  with  the 

template-based  methodology  and  the  CID  process. _ 
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